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MARKERS FOR EVALUATING ESTROGENIC ACTIVITY 

TECHNICAL FIELD 

The present invention relates generally to the 
5 identification of compounds that have estrogenic and/or 
antiestrogenic activity, such as cell-type specific 
estrogenic agents. The invention is more particularly 
related to screens for use in evaluating compounds for 
cardioprotective and bone protective estrogenic effects, and 
10 to methods for using such compounds to prevent or treat 

diseases such as cardiovascular disease, osteoporosis and 
cancer. 

BACKGROUND OF THE INVENTION 

15 Estrogen is a therapeutic component of hormone 

replacement therapy (HRT) , which is commonly used to treat 
symptoms associated with the onset of menopause. This 
hormone has further been shown to provide protection from 
cardiovascular disease, which is a leading cause of death 

20 among American women. While the mechanism of this effect is 
not understood, estrogens appear to decrease the 
accumulation of low-density lipoprotein and cholesterol in 
coronary arteries, and to increase the level of high-density 
lipoprotein. Estrogens also appear to function as 

25 vasodilators and antioxidants, which may further provide 
some cardioprotective effect. Other positive results of 
estrogen therapy include protection from osteoporosis, as 
well as beneficial neuronal effects. However, in spite of 
the benefits of estrogen treatment, many women are concerned 

3 0 about the threat of increased incidence of breast and/or 
uterine cancer. 

A drug that provides the many benefits of estrogen with 
minimal or decreased risk of cancer would alleviate such 
fears and improve the health of many women. In attempts to 

35 identify such a drug, compounds called Selective Estrogen 
Receptor Modulators (SERMs) have been pursued. SERMs are 



1 



compounds that can have estrogenic activity m one tissue, 
while having no estrogenic activity or acting as an estrogen 
antagonist in another tissue. For example, such an agent 
could have estrogenic activity in the bone and 
cardiovascular system (thus preventing osteoporosis and 
cardiovascular disease) , but act as an antiestrogen in 
breast and uterus (thus reducing the risk of breast and 
endometrial cancer) . The use of SERMs clearly offers 
promise as breast cancer preventative agents, and such 
compounds have the potential to reduce the occurrences of 
uterine cancer, osteoporosis and myocardial infarction. 

A few candidate SERMs are in clinical use or in some 
stage of clinical development. Tamoxifen, for example, has 
been used for a number of years in cancer treatment and 
recently won FDA approval for use in breast cancer 
prevention in high-risk women. Raloxifene has been approved 
for use in prevention of osteoporosis. Both compounds were 
first identified as "antiestrogens" due to their estrogen 
antagonistic activity in the breast. However, studies of 
the role of these compounds in cancer prevention have lead 
to conflicting conclusions. Of particular concern is the 
effect of these compounds on uterine proliferation and 
endometrial cancer. In addition, the cardioprotective 
effects of both compounds need further evaluation. 

A significant limitation in the identification and 
evaluation of candidate SERMs is the lack of convenient 
screens. Currently, time-consuming and labor-intensive 
clinical trials are needed to assess the activities of an 
individual candidate. In order to identify a compound that 
has estrogen's positive effects on bone and the 
cardiovascular system, while protecting against estrogen's 
negative effects on breast and endometrial cancer, 
techniques are needed to provide information about a 
potential SERM's tissue specific molecular profile prior to 
in vivo and clinical studies. 



Accordingly, there is a need in the art for a system 
that permits a rapid assay for cell type-specific effects of 
estrogenic therapeutic agents. The present invention 
fulfills this need and further provides other related 
5 advantages. 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention provides screens 
for use in evaluating compounds for cardiovascular effects, 

10 and methods for preventing and treating cardiovascular 
disease. Within certain aspects, the present invention 
provides methods for evaluating the cardiovascular effect of 
a candidate agent,, comprising the steps of: (a) contacting 
an estrogen receptor-enhanced vascular endothelial cell with 

15 a candidate agent; and (b) determining whether the candidate 
agent modulates expression of at least one estrogen- 
regulated marker, wherein the expression of the marker is 
previously determined to be induced by estrogen in estrogen 
receptor-enhanced vascular endothelial cells. In certain 

20 embodiments, step (b) may comprise the steps of: (i) 

detecting a level of an estrogen-regulated marker, and (ii) 
comparing the detected level of the estrogen-regulated 
marker with a level of the estrogen-regulated marker in the 
absence of the candidate agent; or the steps of (i) 

25 detecting a level of mRNA encoding an estrogen-regulated 
marker, and (ii) comparing the detected level of mRNA 
encoding the estrogen- regulated marker with a level of mRNA 
encoding the estrogen-regulated marker in the absence of 
estrogen- related compound. In certain other embodiments, 

3 0 the cell is transfected with a polynucleotide comprising a 
promoter for an estrogen-regulated marker, operably linked 
to a reporter gene. Within such embodiments, step (b) may 
comprise the steps of: (i) detecting a level of reporter 
protein activity, and (ii) comparing the level of reporter 

35 protein activity detected with a level of reporter protein 

in the absence of candidate agent, or may comprise the steps 



of: (i) detecting a level of reporter mRNA, and (ii) 
comparing the level of reporter mRNA detected with a level 
of reporter mRNA in the absence of estrogen-related 
compound. Certain methods evaluate the ability of the 
5 candidate agent to modulate expression of at least two 

estrogen-regulated markers (ERMs) . Preferred markers are 
those listed in Table I, i.e., ERM-1, ERM-2, ERM-3, ERM-4, 
ERM-5, ERM-6, ERM-7, ERM-8, ERM-9, ERM-10, ERM-11, ERM-12, 
ERM-13, ERM-14 , ERM-15 , ERM-16 , ERM-17, ERM-18 , ERM-19 , ERM- 

10 20, ERM-21 , ERM-22, ERM-23, ERM-24, ERM-25, ERM-26, ERM-27, 
ERM-2 8, ERM-29, ERM-30 , ERM-3 1 , ERM-32 , ERM-33, ERM-34 , ERM- 
35, ERM-36, ERM-37, ERM-38, ERM-39, ERM-40, ERM-4 1, ERM-4 2 , 
ERM-43, ERM-44, ERM-4 5 , ERM-46, ERM-4 7 , ERM-4 8 , ERM-4 9 , ERM- 
50, ERM-51, ERM-52, ERM-53, ERM-54, ERM-55, ERM-56, ERM-57 , 

15 ERM-58, ERM-60, ERM-61 , ERM-62, ERM-63, ERM-64, ERM-65, ERM- 
66, ERM-67, ERM-68, ERM-69, ERM-70, ERM-71 , ERM-72, ERM-73, 
ERM-74 and ERM-75. Preferably, the ERM is coded for by a 
message that hybridizes under stringent conditions to any 
one if SEQ ID NOs : 1-19. Preferred estrogen-regulated 

2 0 markers are those having sequences recited in any one of SEQ 
ID NOs : 1-19 . 

Within further aspects, methods are provided for 
identifying a selective estrogen receptor modulator (SERM) , ( 
comprising the steps of: (a) contacting at least two 

25 different cells that express an estrogen-regulated marker 

with a candidate SERM; (b) determining levels of expression 
of at least one estrogen- regulated marker in the cells; and 
(c) comparing the levels of expression with levels of 
expression of the estrogen-regulated marker (s) in cells 

30 following contact with estrogen, and therefrom determining 
whether the candidate SERM is a selective estrogen receptor 
modulator. The estrogen- regulated marker may be, for 
example, partially or fully encoded by one of the nucleic 
acids sequences set forth in SEQ ID NOs: 1-19, or by a 

35 fragment of one of the nucleic acid sequences set forth in 

SEQ ID NOs: 1-19, or by a nucleic acid that hybridizes under 



stringent conditions to one of the nucleic acid sequences 
set forth in SEQ ID NOs : 1-19, and the cells may be 
independently selected from the group consisting of breast, 
endometrial, cardiovascular, bone and central nervous system 
5 cells. Within certain embodiments, levels of at least two 
estrogen- regulated markers are determined. Certain methods 
further comprise the steps of: (d) contacting breast cells 
that express an estrogen- regulated marker expression with 
estrogen and a candidate SERM ; (e) determining levels of 

10 expression of at least one estrogen-regulated marker in the 
breast cells; and (f) comparing the levels of expression 
with levels of expression of the estrogen- regulated 
marker (s) in the breast cells following contact with 
estrogen, in the absence of candidate SERM. 

15 Within further aspects, methods are provided for 

identifying estrogen regulated markers, comprising the steps 
of: (a) contacting a cell that expresses an estrogen 
receptor with an estrogenic compound for a period of time ; 
(b) isolating RNA from said cells; (c) determining levels of 

2 0 expression of one or more RNA molecules; and (d) comparing 
the levels of expression with levels of expression of the 
RNA molecules in cells not contacted with estrogen, and 
therefrom determining whether the one or more RNA molecules 
are estrogen regulated markers. The level of expression of 

2 5 RNA molecules can be determined by, for example, RNA 

fingerprinting or by probing arrays, such as those provided 
by the Incyte Genomics UniGEM technology. 

Within further aspects, methods are provided for 
inhibiting the development of a cardiovascular disease in a 

3 0 patient; comprising administering to a patient an agent 

identified or a SERM identified as described above. 

Methods are also provided, within other aspects, for 
inhibiting the development of breast cancer in a patient; 
comprising administering to a patient a SERM identified as 
3 5 described above, wherein the SERM inhibits induction of 



estrogen-regulated marker expression by estrogen in breast 
cells . 

The present invention further provides, within other 
aspects, methods for inhibiting the development of 
5 osteoporosis in a pat ient ; comprising administering to a 
patient a SERM identified as described above. 

These and other aspects of the present invention will 
become apparent upon reference to the following detailed 
description and attached drawings. All references disclosed 
10 herein are hereby incorporated by reference in their 
entirety as if each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an autoradiogram depicting the level of 

15 expression of vascular endothelial growth factor (VEGF) mRNA 
in VE-ERot cells following exposure to 100 nM 17 , p-estradiol 
for varying amounts of time, as indicated. Vehicle 
(ethanol) controls are shown in the lanes. GADPH 

expression is also shown for reference. 

2 0 Figure 2 is an autoradiogram depicting the level of 

prostacyclin I 2 synthase in VE-ERa cells following exposure 
to 100 nM 17 , fJ-estradiol for varying amounts of time, as 
indicated, in the lanes labeled "E2 " . Vehicle (ethanol) 
controls are shown in the "EtOH" lanes. GADPH expression is 

25 also shown for reference. 

Figure 3 is an autoradiogram depicting the time course 
of cyclin Al induction in estrogen receptor a-enhanced 
vascular endothelial cells, as determined by RNase 
protection analyses. Cells were treated by adding 0.1% 

30 ethanol (ctrl) or 100 nM 17-(5-estradiol (E2) for the 
indicated times. mRNA levels of cyclin Al and 
glyceraldehyde- 3 -phosphate dehydrogenase (GADPH) are shown 
as indicated. 

Figures 4A and 4B are autoradiograms illustrating RNase 
35 protection experiments with estrogen receptor a- and 



estrogen receptor (i-enhanced vascular endothelial cells. 
Cells were treated by adding 0.1% ethanol (ctrl) , 100 nM 17- 
p-estradiol (E2) , 1 |iM ICI 164384 (ICI) or 10 nM 
diethylstilbestrol (DES) , as indicated. In each case, the 
5 levels of various cyclins are shown, as indicated by the 
designations to the right of the autoradiogram. 

Figure 5 is an autoradiogram depicting the effect of 
varying amounts of 17-P~estradiol on the levels of various 
estrogen-regulated markers (as indicated) . The GAPDH 
10 control is labeled and DDST is diastrophic dysplasia sulfate 
transporter. 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally 

15 directed to screens for use in evaluating compounds for 
estrogenic effects, such as cell-type specific and/or 
cardioprotective effects. Compounds that may be identified • 
using such methods include those with mixed agonist and 
antagonist properties, which can be used in prevention of 

20 breast cancer while also having estrogen-like positive 
effects in bone and the cardiovascular system. 

The present invention is based, in part, on the 
identification of estrogen-regulated markers, which can be 
used to evaluate candidate agents for cell type-specific 

25 estrogenic activity and cardioprotective effects. Such 
markers may be used, for example, to obtain estrogen- 
regulated marker expression profiles in tissues of interest 
(e.g., breast, endometrium, bone, central nervous system and 
cardiovascular tissue) . Such profiles may be used to 

3 0 evaluate the estrogenic activity of a candidate agent in 
various cell types, thereby permitting rapid screening of 
candidate agents, for selective estrogen receptor modulator 
(SERM) activity. 

3 5 ESTROGEN- REGULATED MARKERS (ERMS) 



An "estrogen- regulated marker, " or "ERM," as used 
herein is a gene which has an altered level of expression, 
either increased or decreased, preferably at least two fold 
greater, in response to estrogen in estrogen receptor- 
5 expressing cells, such as estrogen receptor-enhanced 

vascular endothelial cells or mammary epithelial cells. As 
used herein, the term "estrogen receptor-enhanced" refers to 
cells that express an estrogen receptor (e.gr., estrogen 
receptor alpha (ERcc) and/or estrogen receptor beta (ER0) ) at 

10 a level that is sufficient to permit the cells to respond to 
estrogen using any assay that measures an estrogen response. 
Vascular endothelial and mammary epithelial cells, in 
culture, do not naturally express estrogen receptors at such 
a level. However, such cells may be rendered estrogen 

15 receptor- enhanced by transfection with one or more estrogen 
receptor genes, as discussed herein. Assays to detect an 
estrogen response include assays to detect enhanced levels 
of any estrogen responsive gene i.e. vascular endothelial 
growth factor (VEGF) mRNA following treatment with estrogen, 

20 as well as assays to detect the increases in the expression 
of an estrogen-regulated marker or ERM related protein as 
provided herein. 

The term "ERM related protein" as used herein refers to 
a polypeptide that possesses a similar or identical amino 

25 acid sequence to that of an ERM, or possesses a structure 
that is similar or identical to that of the ERM. As used 
herein, an amino acid sequence of a polypeptide is "similar" 
to that of an ERM if it satisfies at least one of the 
following criteria: (a) the polypeptide has an amino acid 

30 sequence that is at least 30% (more preferably, at least 

35%, at least 40%, at least 45%, at least 50%, at least 55%, 
at least 60%, at least 65%, at least 70%, at least 75%, at 
least 80%, at least 85%, at least 90%, at least 95% or at 
least 99%) identical to the amino acid sequence of the ERM; 

35 (b) the polypeptide is encoded by a nucleotide sequence that 
hybridizes under stringent conditions to a nucleotide 



sequence encoding at least 5 amino acid residues (more 
preferably, at least 10 amino acid residues, at least 15 
amino acid residues, at least 20 amino acid residues, at 
least 25 amino acid residues, at least 40 amino acid 
5 residues, at least 50 amino acid residues, at least 60 amino 
residues, at least 70 amino acid residues, at least 80 amino 
acid residues, at least 90 amino acid residues, at least 100 
amino acid residues, at least 125 amino acid residues, or at 
least 150 amino acid residues) of the ERM; or (c) the 

10 polypeptide is encoded by a nucleotide sequence that is at 
least 30% (more preferably, at least 35%, at least 40%, at 
least 45%, at least 50%, at least 55%, at least 60%, at 
least 65%, at least 70%, at least 75%, at least 80%, at 
least 85%, at least 90%, at least 95% or at least 99%) 

15 identical to the nucleotide sequence encoding the ERM. 

For a high degree of selectivity, relatively stringent 
conditions are used to form the duplexes, such as low salt 
or high temperature conditions. As used herein, "highly 
stringent conditions 11 means hybridization to filter-bound 

20 DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS) , 1 mM 
EDTA at 65°C, and washing in 0.1xSSC/0.1% SDS at 68°C 
(Ausubel F.M. et al . , eds . , 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, 
Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3; 

25 incorporated herein by reference in its entirety.) For some 
applications, less stringent conditions for duplex formation 
are required. As used herein "moderately stringent 
conditions" means washing in 0.2xSSC/0.1% SDS at 42 °C 
(Ausubel et al . , 1989, supra). Hybridization conditions can 

30 also be rendered more stringent by the addition of 

increasing amounts of formamide, to destabilize the hybrid 
duplex. Thus, particular hybridization conditions can be 
readily manipulated, and will generally be chosen depending 
on the desired results. In general, convenient 

35 hybridization temperatures in the presence of 50% formamide 
are: 42 °C for a probe which is 95 to 100% identical to the 



fragment of a gene encoding an ERM, 37°C for 90 to 95% 
identity and 32°C for 70 to 90% identity. 

In general, estrogen-regulated marker expression is 
detectably changed in an estrogen receptor-enhanced vascular 
5 endothelial cell or mammary epithelial cell treated with 
estrogen as described herein, whereas expression of the 
marker does not respond detectably to estrogen in such cells 
without estrogen-receptor enhancement. Estrogen receptor- 
enhanced cells may be derived from primary cells obtained 

10 from a human or non-human animal, or may be derived from a 
cell line. For example, primary vascular endothelial cells 
or mammary epithelial cells may be infected with an 
immortalizing retrovirus (i.e., a retrovirus that expresses 
an oncogene that leads to immortalization of infected 

15 cells) . Such immortalized cells have very low expression of 
estrogen receptors and no detectable estrogen response. 
Estrogen receptor enhancement may then be achieved by 
infection with a retrovirus that expresses recombinant 
estrogen receptor a or estrogen receptor p. Infected cells 

20 may generally be selected based on antibiotic resistance. 

The level of estrogen receptor mRNA may be confirmed using, 
for example, an RNase protection assay, and the level of 
estrogen receptor protein determined by Western analysis or 
immunohistochemical techniques. 

25 Estrogen-regulated markers may be identified in such 

cells using any technique that is suitable for detecting 
differential gene expression. There are numerous such 
techniques that are well known in the art. One convenient 
screening method involves the use of commercially available 

30 chips containing large numbers of human sequences (e.gr., the 
UniGEM technology, available from Incyte) . Briefly, polyA 
RNA is isolated from cells treated with estrogen and from 
cells treated with a vehicle control. The RNA is used to 
generate labeled cDNA probes, which are then hybridized to 

3 5 the immobilized human sequences. In a preferred embodiment, 
sequences that are expressed at a level that is at least two 



fold greater in the presence of estrogen than in the absence 
of estrogen are considered estrogen-regulated markers, 
although sequences with any detectable increase or decrease 
may be used as markers. The estrogen-regulated marker may 
5 be, for example, partially or fully encoded by one of the 

nucleic acids sequences set forth in SEQ ID NOs : 1-19, or by 
a fragment of one of the nucleic acid sequences set forth in 
SEQ ID NOs: 1-19, or by a nucleic acid that hybridizes under 
stringent conditions to one of the nucleic acid sequences 

10 set forth in SEQ ID NOs : 1-19. Within the context of the 
present invention, the sequences listed in Table I (see 
Example 2) have been found to encode estrogen- regulated 
markers. Preferred estrogen-regulated markers include, for 
example, cyclin Al , prostacyclin (PGI2) synthase, connexin 

15 37, diastrophic dysplasia sulfate transporter, podocalyxin, 
and clusterin. Prefered estrogen- regulated markers are 
selected based on biological relevence, fold activation and 
other factors. It will be apparent, however, that other 
estrogen-regulated markers may be identified using the 

2 0 methods provided herein, and may be used as markers within 
the screens discussed below. 

ASSAYS FOR THE IDENTIFICATION OF THERAPEUTIC AGENTS 

The estrogen-regulated markers provided herein may be 
used within assays designed to identify therapeutic agents 
with estrogenic and/or antiestrogenic (i.e., estrogen 
antagonistic) activity. Candidate compounds for use within 
the assays provided herein may be derived from any source, 
including small molecule combinatorial libraries. Certain 
candidate compounds are estrogen-related. As used herein, 
an "estrogen-related compound" is any compound that binds 
detectably to an estrogen receptor. Such binding may be 
detected using any assay known in the art, including direct 
binding assays and functional assays (e.gr., assays for the 
ability to stimulate VEGF expression) . 



Within certain assays, estrogen-regulated markers, ERM 
related proteins, or nucleic acids encoding ERMs or ERM 
related proteins may be used to identify selective estrogen 
receptor modulators (referred to herein as SERMs) . Known 
5 SERMs include Raloxifene (Evista) ; Tamoxifen (Nolvadex) ; 
Toremif ene (Fareston) ; Idoxifene; Droloxifene; 
Tibolone (Livial) ; Levormeloxif ene ; GW-5638; CP-336,156 
(Lasofoxifene) ; ICI-182,780 (Faslodex) ; EM-800 (EM-652) ; 
TAT- 59 (Miproxif ene) ; Compound 7b (cyclopropyl 

10 antiestrogen) ; Analog II (diaryl cyclopropane) ; ERA-923; TSE- 
424; SR 16234, SR16287; LY353381; HMR-3339; EM-800. In 
preferred embodiments, the methods of the invention do not 
include the administration, identification or screening for 
any of these recited, known SERMs. In preferred 

15 embodiments, the compositions of the invention do not 
include the presence of a recited, known SERM. 

Within such assays, the effect of a candidate modulator 
on an expression profile of one or more estrogen-regulated 
markers or ERM related proteins or nucleic acid encoding 

2 0 therefor is determined. In a preferred embodiment, the 

detected marker is or comprises a fragment of one or more of 
the following: ERM- 1 , ERM-2 , ERM- 3 , ERM-4 , ERM- 5 , ERM-6 , 
ERM- 7 , ERM -8 , ERM- 9 , ERM- 10 , ERM- 11 , ERM- 12 , ERM- 13 , ERM- 14 ,. 
ERM-15, ERM-16, ERM- 17, ERM- 18 , ERM- 19, ERM-20, ERM-21, ERM- 

25 22, ERM-23, ERM- 24 , ERM-2 5 , ERM-26, ERM-2 7 , ERM-28, ERM-29, 
ERM-30, ERM-31, ERM- 3 2 , ERM- 3 3 , ERM-34, ERM-35, ERM-3 6, ERM- 
37, ERM-38, ERM- 39, ERM-4 0 , ERM-4 1, ERM- 4 2 , ERM- 4 3 , ERM- 44 , 
ERM-4 5 , ERM-46, ERM-47, ERM-4 8 , ERM-4 9 , ERM-50, ERM- 51, ERM- 
52, ERM-53, ERM- 54 , ERM- 55, ERM-56, ERM-57, ERM- 5 8 , ERM- 60 , 

30 ERM-61, ERM- 62 , ERM- 63 , ERM-64, ERM-65, ERM-66, ERM-67, ERM- 
68, ERM- 69, ERM- 70 , ERM-71, ERM-72, ERM- 73 , ERM- 74 and ERM- 
75. In another preferred embodiment, one or more estrogen- 
regulated markers are detected by hybridization under highly 
stringent conditions of one or more of SEQ ID NOs : 1-19 to 

35 mRNA from a cell of interest. An expression profile may be 
obtained using two or more cell types of interest. Briefly, 



the effect of estrogen on expression of an estrogen- 
regulated marker or ERM related protein, or nucleic acid 
coding therefor is determined in the selected cell types. 
The effect of a comparable amount of the candidate modulator 
5 is then determined and, for each cell type, the effect of 
the candidate modulator is compared to the effect of the 
estrogen . 

More specifically, cell-types to be used within an 
expression profile are selected to permit the identification 

10 of SERMs that have estrogenic activity in tissues where such 
activity is beneficial {e.g., bone, central nervous system, 
urogenital tissue and cardiovascular tissue) , and that have 
no activity or ant i -estrogenic activity in tissues where 
estrogenic activity is potentially harmful (e.g., breast, 

15 ovarian and endometrial tissue) . Preferred cell types for 
use in obtaining an expression profile include commercially 
available cells such as mammary gland adenocarcinoma cells, 
e.g., MCF-7 cells; mammary gland primary ductal carcinoma, 
e.g., HCC70, HCC1395, or HCC1500 cells, uterine sarcoma 

20 cells, e.g., MES-SA cells, cervical squamous cell carcinoma 
cells, e.g., SW756 cells, or human umbilical cord vascular 
enthothelial cells, e.g., HUVECs cells. Other cell lines 
which have been developed include endometrial epithelial, 
endometrial stromal, and osteoblast cells. Cells of nearly 

2 5 any type can be produced by, for example, obtaining early 
passage primary cells from commercial sources (e.g. Cell 
Applications, Inc., San Diego, CA) and infecting them with 
the immortalizing retrovirus LXSN16E6E7. Immortalized cells 
are selected by G418 treatment and determined to be growth 

30 extended relative to uninfected cells. 

An expression profile of one or more ERMs, ERM related 
proteins or nucleic acids coding therefor may be obtained in 
a variety of ways, including measurement of the level of 
ERMs or ERM related proteins, or mRNA encoding the ERMs or 

35 ERM related proteins, in the selected cells. For example, 
antibodies specific for the protein of interest may be used 



to measure the level of expression of a protein of interest 
in, for example, an ELISA or Western blot. For several of 
the ERMs identified herein, antibodies are commercially 
available, and methods of making antibodies that recognize 
5 a protein are known to those of skill in the art. 

Polycolonal or monoclonal antibodies may be utilized. 

Alternatively, the endogenous promoter for the gene 
encoding the ERM or ERM related protein may be isolated and 
operably linked to a reporter gene. Reporter genes include 

10 the luc if erase gene, CAT, or any other reporter known to 
those of skill in the art. Cells transformed with such a 
construct may be used to generate an expression profile 
based on the level of reporter protein or mRNA in the cells. 
Assays for qualitatively and quantitatively determining 

15 mRNA and protein levels in a sample are well known in the 
art and amply described in the patent and scientific 
literature. For example, expression may be determined at 
the transcriptional level (i.e., using any of a variety of 
well known hybridization or amplification techniques to 

20 assess the level of estrogen-regulated marker mRNA) or at 

the translational level (i.e., using a compound, such as an 
antibody, that binds to estrogen-regulated marker to assess 
the level of such protein synthesized in any standard assay 
format, such as ELISA). Representative methods are also 

2 5 provided herein. 

After obtaining an expression profile, the effect of 
estrogen on the expression profile is generally determined. 
Any active form of estrogen may be used, such as 17-(i 
estradiol or diethylstilbesterol (DES) , generally in an 

3 0 amount ranging from 1 nM to 100 nM (preferably about 10 nM) 

for about 1-24 hours Following exposure of the cells to 
the estrogen, the expression profile for the estrogen- 
regulated marker is again determined, and the effect of 
estrogen on marker expression in each cell type is obtained. 
35 Optionally, the effect of estrogen on ERMs or ERM related 

proteins can be determined after exposure of the test cells 



to an effector molecule or protein other than an estrogenic 
one, such as TNF or IL-1. Thus, in one embodiment, cells 
are first contacted with a cytokine or other molecule with 
biological effect, such as TNF or IL-1, then contacted with 
5 an estrogenic molecule, such as DES . Expression profiles 
can be determined prior to contact with any effector 
molecule, after contact with the non-estrogenic effector 
molecule, and after contact with the estrogenic molecule. 

Candidate SERMs may be evaluated in a similar manner. 

10 The cell types used in the estrogen- induced expression 

profile are contacted with the candidate SERM (generally in 
an amount ranging from 1 nM to 1000 nM, preferably 1 nM to 
100 nM) for a period of time that is substantially similar 
to that used for exposure to estrogen (i.e., within about 

15 10% of the estrogen contact time) . Following exposure of 

the cells to the candidate SERM, the expression profile for 
the estrogen-regulated marker is determined, and the effect 
of the candidate SERM on marker expression in each cell type 
is obtained. Finally, for each cell type, the effect of the 

2 0 candidate SERM is compared to the effect of estrogen and 

other SERMS (i.e. tamoxifen, raloxifene and ICI 164,384) on 
marker expression. These other SERMs act by binding to the 
ligand binding site of the estrogen receptor. This is not a 
necessary criteria for a SERM. Any compound which alters 
25 activity of the estrogen receptor in the desired tissue (s) 
(i.e. increasing activity in tissues which estrogenic 
activity is desirable and decreasing where estrogenic 
activity is detrimental) would qualify. This would include 
compounds functioning by altering formation of the estrogen 

3 0 receptor complex (i.e. interactions between the estrogen 

receptor and coactivators and corepressors) . A SERM is any 
candidate for which (a) the effect in test cells where 
estrogenic activity is desired is not substantially lower 
than the effect of estrogen (i.e., activity is greater, the 
35 same or at least 90%, 80%, 70%, 50%, 40%, or 30% of the 

activity of estrogen) and (b) the effect in test cells where 



estrogenic activity is undesirable is minimal (i.e., no more 
than 10%, 20%, or 30% of the activity of estrogen) , 
undetectable or antiestrogenic. 

For certain uses, a SERM preferably has antiestrogenic 
5 activity in certain cells (e.g., breast and endometrial). 
Antiestrogenic activity may be detected using the assays 
described above, with the modification that the level (s) of 
estrogen-regulated marker (s) are detected in the presence of 
both estrogen and candidate SERM, and compared to the levels 

10 detected in the presence of estrogen and the absence of 
candidate SERM. A SERM has antiestrogenic activity if 
contact with the SERM results in a decrease in estrogen- 
induced expression of the estrogen-regulated marker (s). 
Preferably, the decrease is at least 50% of the level of 

15 induction by estrogen. 

Other assays may be used to evaluate the cardiovascular 
effect of a candidate agent. Briefly, the effect of a 
candidate agent on estrogen-regulated marker expression is 
determined in estrogen receptor-enhanced vascular 

20 endothelial cells. Estrogen-regulated marker expression may 
be evaluated in such cells by determining the level of 
marker mRNA or protein present, as discussed above. 
Alternatively, as discussed above, cells for use in such 
assays may be transfected with a plasmid comprising an 

2 5 estrogen- regulated marker promoter operably linked to a 

reporter gene. In such transfected cells, marker expression 
may be determined by evaluating reporter gene expression 
(i.e., measuring the level of reporter gene mRNA, reporter 
protein or reporter activity), as described above. Such 

3 0 screens may be used, for example, to identify estrogen- 

related compounds that have minimal cardiovascular effects, 
or to identify drugs for preventing and/or treating 
cardiovascular disease. 

More specifically, to evaluate the cardiovascular 
35 effect of a candidate agent, estrogen-regulated marker 

expression is generally evaluated by any assay described 



herein within an estrogen receptor-enhanced vascular 
endothelial cell in the presence of the compound. Such 
expression is then compared to expression of the marker 
observed in the absence of candidate agent. Within certain 
5 preferred embodiments, estrogen-regulated marker expression 
is determined using real-time PCR analyses performed on RNA 
obtained from cells that were exposed to one or more 
candidate agents, as well as cells that were not exposed to 
such a compound. A compound that enhances estrogen- 
ic regulated marker expression may have a cardioprotective 

effect similar to that of estrogen. Such compounds may be 
of therapeutic value in treating certain cardiovascular 
diseases, or may be used prophylactically to inhibit the 
development of such diseases. 
15 Another preferred assay is an RNase protection assay 

(RPA) , which can be used to examine a panel of genes 
identified as estrogen-regulated in VE-ERa cells. The RPA 
is described in Gilman, 1993, In Current Protocols in 
Molecular Biology, Vol. 1 (Ausubel, F.M., R. Brent, R,E. 
20 Kingston, D.D, Moore, J.G. Seidman, J. A. Smith and K. Stuhl, 
eds.) pp. 4.7.1-4.7.8, John Wiley and Sons, Inc., New York, 
incorporated herein by reference in its entirety. Within 
this assay, which may be performed according to standard 
protocols, multiple probes can be used in a single reaction 
25 that requires as little as 1 /xg of total RNA, resulting in 
high information content and a molecule profile for each 
compound on ERot regulated gene expression in vascular 
endothelial cells. The RPA assay is very sensitive and is 
therefore able to detect low copy messages and distinguish 
3 0 low fold modulations typical of estrogen. This approach has 
a number of advantages over a reporter assay. The 
endogenous genes in the multigene assay will contain all 
endogenous gene regulatory elements and will not be 
restricted to activation occurring through an ERE element. 
35 Since VE-ERa cells are human cells that have been 



characterized and determined to maintain endothelial 
specific phenotypic markers and functions, tissue selective 
ER regulation can be investigated. 

Within further aspects, methods are provided for 
5 identifying estrogen regulated markers, comprising the steps 
of: (a) contacting a cell that expresses an estrogen 
receptor with an estrogenic compound for a period of time ; 
(b) isolating RNA from said cells; (c) determining levels of 
expression of one or more RNA molecules; and (d) comparing 

10 the levels of expression with levels of expression of the 
RNA molecules in cells not contacted with estrogen, and 
therefrom determining whether the one or more RNA molecules 
are estrogen regulated markers. The level of expression of 
RNA molecules can be determined by, for example, RNA 

15 fingerprinting or by probing arrays, such as those provided 
by the Incyte Genomics UniGEM technology. The cell may be 
one that normally expresses an estrogen receptor, or one 
that has been genetically engineered to express an estrogen 
receptor . 

20 

ERM PEPTIDES AND NUCLEIC ACIDS 

The invention features nucleic acid molecules which are 
at least 30%, 35%, 40%, 45%, 50%, 55%, 65%, 75%, 85%, 95%, 
or 98% identical to the nucleotide sequence of SEQ ID NO: 1; 

25 SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ 
ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID 
NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID 
NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID 
NO: 18; SEQ ID NO: 19; or a sequence referred to by 

30 accession number for any ERM listed in table I, or a 

complement thereof. Preferably, the nucleic acid molecules 
are at least 30%, 35%, 40%, 45%, 50%, 55%, 65%, 75%, 85%, 
95%, or 98% identical to at least or about 50, 100, 150, 200 
or 250 consecutive nucleic acids of SEQ ID NO: 1; SEQ ID NO: 

35 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; 

SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ 



ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ 
ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ 
ID NO: 19; or a sequence referred to by accession number for 
any ERM listed in table I, or a complement thereof. 
5 The invention features isolated polypeptides or 

proteins which are encoded by a nucleic acid molecule having 
or comprising a nucleotide sequence that is at least about 
30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 75%, 85%, 95%, 
or 98% identical to the nucleotide sequence of SEQ ID NO: 1; 

10 SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ 
ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID 
NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID 
NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID 
NO: 18; SEQ ID NO: 19; or a sequence referred to by 

15 accession number for any ERM listed in table I, a complement 
thereof, or the non-coding strand thereof, wherein the 
polypeptides or proteins also exhibit at least one 
structural and/or functional feature of a polypeptide of the 
invention. 

20 The invention features nucleic acid molecules of at 

least 50, 100, 150, 200, or 250 nucleotides of the 
nucleotide sequence of SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID 
NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; SEQ ID NO: 
7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID NO: 11; 

25 SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ ID NO: 15; 
SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID NO : 19; 
or a sequence referred to by accession number for any ERM 
listed in table I, or a complement thereof. 

The invention features isolated polypeptides or 

3 0 proteins which are encoded by a nucleic acid molecule having 
or comprising a nucleotide sequence that is at least about 
50, 100, 150, 200, or 250 or more contiguous nucleotides 
identical to the nucleic acid sequence of SEQ ID NO: 1; SEQ 
ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID 

35 NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 
10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 



14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 
18; SEQ ID NO: 19; or a sequence referred to by accession 
number for any ERM listed in table I, or a complement 
thereof. Preferably, the polypeptides or proteins also 
5 exhibit at least one structural and/or functional feature of 
a polypeptide of the invention. 

The invention also features nucleic acid molecules 
which include a nucleotide sequence encoding a protein 
having an amino acid sequence that is at least 25%, 30%, 

10 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% 

identical to the amino acid sequence of a protein coded for 
by any nucleic acid sequence recited or referred to by 
accession number in Table I . 

In preferred embodiments, the nucleic acid molecules 

15 have the nucleotide sequence of SEQ ID NO: 1; SEQ ID NO: 2; 
SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; SEQ 
ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID 
NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ ID 
NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID 

20 NO: 19; or a sequence referred to by accession number for 
any ERM listed in table I. 

Also within the invention are nucleic acid molecules 
which encode a polypeptide having the amino acid sequence of 
a protein coded for by any nucleic acid sequence recited or 

25 referred to by accession number in Table I, or a fragment 

thereof including at least 10, 15, 20, 25, 30, 50, 75 or 80 
contiguous amino acids thereof. 

The invention also features nucleic acid molecules 
which encode a polypeptide fragment of at least 10, 15, 25, 

30 30, 50, 75, 80 or more contiguous amino acids of a protein 
coded for by any nucleic acid sequence recited or referred 
to by accession number in Table I. Preferably, the fragment 
exhibits at least one structural and/or functional feature 
of a polypeptide of the invention. 

35 The invention includes nucleic acid molecules which 

encode a naturally occurring allelic variant of a 



polypeptide comprising the amino acid sequence of a protein 
coded for by any nucleic acid sequence recited or referred 
to by accession number in Table I . 

Also within the invention are isolated polypeptides or 
5 proteins having an amino acid sequence that is at least 

about 30%, preferably 40%, 45%, 55%, 65%, 75%, 85%, 95%, or 
98% identical to the amino acid sequence of a protein coded 
for by any nucleic acid sequence recited or referred to by 
accession number in Table I . 

10 Also within the invention are isolated polypeptides or 

proteins which are encoded by a nucleic acid molecule having 
a nucleotide sequence that is at least about 30%, preferably 
35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% 
identical to the nucleic acid sequence encoding a protein 

15 coded for by any nucleic acid sequence recited or referred 

to by accession number in Table I, and isolated polypeptides 
or proteins which are encoded by a nucleic acid molecule 
having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid 

2 0 molecule having the nucleotide sequence of SEQ ID NO: 1; SEQ 

ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID 
NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 
10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 
14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 
25 18; SEQ ID NO: 19; or a sequence referred to by accession 
number for any ERM listed in table I, or a complement 
thereof. Preferably, the polypeptides or proteins also 
exhibit at least one structural and/or functional feature of 
a polypeptide of the invention. 

3 0 Also within the invention are polypeptides which are 

naturally occurring allelic variants of a polypeptide that 
includes the amino acid sequence of a protein coded for by 
any nucleic acid sequence recited or referred to by 
accession number in Table I, wherein the polypeptide is 
3 5 encoded by a nucleic acid molecule which hybridizes to a 

nucleic acid molecule having the sequence of SEQ ID NO: 1; 



SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ 
ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID 
NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID 
NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID 
5 NO: 18; SEQ ID NO: 19; or a sequence referred to by 
accession number for any ERM listed in table I, or a 
complement thereof under stringent conditions. 

The invention also features nucleic acid molecules that 
hybridize under stringent conditions to a nucleic acid 

10 molecule having the nucleotide sequence of SEQ ID NO: 1; SEQ 
ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID 
NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 
10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 
14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 

15 18; SEQ ID NO: 19; or a sequence referred to by accession 
number for any ERM listed in table I, or a complement 
thereof. In one embodiment, the nucleic acid molecules are 
at least 50, 100, 150, 200, 272, 300, 400, 500, 550, 600, 
650, 700, 750, 800, 1000, 1100, 1200, 1300, 1400, 1500, 

20 1600, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 
2500, 2600 or 2700 nucleotides in length and hybridize under 
stringent conditions to a nucleic acid molecule comprising 
the nucleotide sequence of SEQ ID NO: 1; SEQ ID NO: 2; SEQ 
ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; SEQ ID 

25 NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID NO: 
11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ ID NO: 
15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID NO: 
19; or a sequence referred to by accession number for any 
ERM listed in table I, or a complement thereof. 

3 0 The invention also features nucleic acid molecules at 

least 15, preferably 50, 75, 100, 150, 200, 250, 300, 350, 
400, 500, 600 or more nucleotides in length and hybridize 
under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO: 1; SEQ ID 

35 NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 
6; SEQ ID NO : 7 ; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO : 10; 



SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; 
SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; 
SEQ ID NO: 19; or a sequence referred to by accession number 
for any ERM listed in table I, or a complement thereof. 
5 Preferably, such nucleic acid molecules encode polypeptides 
or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 

In one embodiment, the invention provides an isolated 
nucleic acid molecule which is antisense to the coding 

10 strand of a nucleic acid of the invention. 

As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.gr., cDNA or genomic 
DNA) and RNA molecules (e.gr., mRNA) and analogs of the DNA 
or RNA generated using nucleotide analogs. The nucleic acid 

15 molecule can be single -stranded or double-stranded, but 
preferably is double -stranded DNA. 

An "isolated" nucleic acid molecule is one which is 
separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid molecule. 

20 Preferably, an "isolated" nucleic acid molecule is free of 
sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located at 
the 5 1 and 3 1 ends of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For 

25 example, in various embodiments, the isolated nucleic acid 

molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 
1 kB, 0.5 kB or 0.1 kB of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of 
the cell from which the nucleic acid is derived. Moreover, 

30 an "isolated" nucleic acid molecule, such as a cDNA 

molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors or 
other chemicals when chemically synthesized. As used 

35 herein, the term "isolated" when referring to a nucleic acid 
molecule does not include an isolated chromosome. 



In instances wherein the nucleic acid molecule is a 
cDNA or RNA, e.gr., mRNA, molecule, such molecules can 
include a poly A "tail", or, alternatively, can lack such a 
3 ' tail . Although cDNA or RNA nucleotide sequences may be 
5 depicted herein with such tail sequences, it is to be 

understood that cDNA nucleic acid molecules of the invention 
are also intended to include such sequences lacking the 
depicted poly A tails. 

A nucleic acid molecule of the present invention, e.g., 

10 a nucleic acid molecule having the nucleotide sequence of 

SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 4; SEQ 
ID NO: 5; SEQ ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID 
NO: 9; SEQ ID NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; SEQ ID 
NO: 13; SEQ ID NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID 

15 NO: 17; SEQ ID NO: 18; SEQ ID NO: 19; or a sequence referred 
to by accession number for any ERM listed in table I or a 
complement thereof, can be isolated using standard molecular 
biology techniques and the sequence information provided 
herein. Using all or a portion of the nucleic acid 

2 0 sequences of SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID NO: 3; SEQ 
ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; SEQ ID NO: 7; SEQ ID 
NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID NO: 11; SEQ ID 
NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ ID NO: 15; SEQ ID 
NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID NO: 19; or a 

25 sequence referred to by accession number for any ERM listed 
in table I as a hybridization probe, nucleic acid molecules 
of the invention can be isolated using standard 
hybridization and cloning techniques (e.g., as described in 
Sambrook et al . , eds . , Molecular Cloning: A Laboratory 

30 Manual, 2nd ed. , Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) . 

A nucleic acid molecule of the invention can be 
amplified using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 

35 PCR amplification techniques. The nucleic acid so amplified 
can be cloned into an appropriate vector and characterized 



by DNA sequence analysis. Furthermore, oligonucleotides 
corresponding to all or a portion of a nucleic acid molecule 
of the invention can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 
5 As used herein, the term "hybridizes under stringent 

conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences 
at least 60% (65%, 70%, preferably 75%) identical to each 
other typically remain hybridized to each other. Such 

10 stringent conditions are known to those skilled in the art 
and can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, 
non-limiting example of stringent hybridization conditions 
are hybridization in 6X sodium chloride/sodium citrate (SSC) 

15 at about 45° C, followed by one or more washes in 0 . 2 X SSC, 
0.1% SDS at 50-65° C. Preferably, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO: 1; SEQ ID NO: 2 ; 
SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID NO: 5; SEQ ID NO: 6; SEQ 

20 ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID 
NO: 11; SEQ ID NO: 12; SEQ ID NO: 13; SEQ ID NO: 14; SEQ ID 
NO: 15; SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID 
NO: 19; or a sequence referred to by accession number for 
any ERM listed in table I or a complement thereof, 

25 corresponds to a naturally-occurring nucleic acid molecule. 
As used herein, a "naturally-occurring" nucleic acid 
molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g., encodes a 
natural protein) . 

30 In addition to naturally-occurring allelic variants of 

a nucleic acid molecule of the invention sequence that may 
exist in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation 
thereby leading to changes in the amino acid sequence of the 

35 encoded protein, without altering the biological activity of 
the protein. For example, one can make nucleotide 



substitutions leading to amino acid substitutions at 
"non-essential" amino acid residues. A "non-essential" 
amino acid residue is a residue that can be altered from the 
wild-type sequence without altering the biological activity, 
5 whereas an "essential" amino acid residue is required for 
biological activity. For example, amino acid residues that 
are not conserved or only semi -conserved among homologues of 
various species may be non-essential for activity and thus 
would be likely targets for alteration. Alternatively, 
10 amino acid residues that are conserved among the homologues 
of various species (e.gr., mouse and human) may be essential 
for activity and thus would not be likely targets for 
alteration. 

Another aspect of the invention provides vectors, e.gr., 

15 recombinant expression vectors, comprising a nucleic acid 
molecule of the invention. In another embodiment, the 
invention provides host cells containing such* a vector or 
engineered to contain and/or express a nucleic acid molecule 
of the invention. The invention also provides methods for 

2 0 producing a polypeptide of the invention by culturing, in a 
suitable medium, a host cell of the invention such that a 
polypeptide of the invention is produced. As used herein, 
the term "vector" refers to a nucleic acid molecule capable 
of transporting another nucleic acid to which it has been 

25 linked. - One type of vector is a "plasmid" , which refers to 
a circular double stranded DNA loop into which additional 
DNA segments can be ligated. Another type of vector is a 
viral vector, wherein additional DNA segments can be ligated 
into the viral genome. Certain vectors are capable of 

30 autonomous replication in a host cell into which they are 
introduced (e.gr., bacterial vectors having a bacterial 
origin of replication and episomal mammalian vectors) . 
Other vectors (e.gr., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction 

35 into the host cell, and thereby are replicated along with 
the host genome. Moreover, certain vectors, expression 



vectors, are capable of directing the expression of genes to 
which they are operably linked. In general, expression 
vectors of utility in recombinant DNA techniques are often 
in the form of plasmids (vectors) . However, the invention 
5 is intended to include such other forms of expression 

vectors, such as viral vectors (^e.gr., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), 
which serve equivalent functions. 

The recombinant expression vectors of the invention 

10 comprise a nucleic acid of the invention in a form suitable 
for expression of the nucleic acid in a host cell. This 
means that the recombinant expression vectors include one or 
more regulatory sequences, selected on the basis of the host 
cells to be used for expression, which is operably linked to 

15 the nucleic acid sequence to be expressed. Within a 

recombinant expression vector, "operably linked" is intended 
to mean that the nucleotide sequence of interest is linked 
to the regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.gr., in an in vitro 

20 transcription/translation system or in a host cell when the 
vector is introduced into the host cell) . The term 
"regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.gr-, 
polyadenylation signals) . Such regulatory sequences are 

25 described, for example, in Goeddel, Gene Expression 

Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990) . Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in 
many types of host cell and those which direct expression of 

30 the nucleotide sequence only in certain host cells (e.gr., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design of 
the expression vector can depend on such factors as the 
choice of the host cell to be transformed, the level of 

35 expression of protein desired, etc. The expression vectors 
of the invention can be introduced into host cells to 



thereby produce proteins' or peptides, including fusion 
proteins or peptides, encoded by nucleic acids as described 
herein . 

The recombinant expression vectors of the invention can 
5 be designed for expression of a polypeptide of the invention 
in prokaryotic (e.g., E. coli ) or eukaryotic cells (e.g., 
insect cells (using baculovirus expression vectors) , yeast 
cells or mammalian cells) . Suitable host cells are 
discussed further in Goeddel, supra.. Alternatively, the 

10 recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often 
carried out in E. coli with vectors containing constitutive 

15 or inducible promoters directing the expression of either 

fusion or non- fusion proteins. Fusion vectors add a number 
of amino acids to a protein encoded therein, usually to the 
amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase 

20 expression of recombinant protein; 2) to increase the 

solubility of the recombinant protein; and 3) to aid in the 
purification of the recombinant protein by acting as a 
ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is 

2 5 introduced at the junction of the fusion moiety and the 

recombinant protein to enable separation of the recombinant 
protein from the fusion moiety subsequent to purification of 
the fusion protein. Such enzymes, and their cognate 
recognition sequences, include Factor Xa, thrombin and 

3 0 enterokinase . Typical fusion expression vectors include 

pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 
67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione 
S- transferase (GST) , maltose E binding protein, or protein 
3 5 A, respectively, to the target recombinant protein. 



Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . , (1988) Gene 
69:301-315) and pET lid (Studier et al . , Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San 
5 Diego, California (1990) 60-89) . Target gene expression 
from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target 
gene expression from the pET lid vector relies on 
transcription from a T7 gnlO-lac fusion promoter mediated by 
10 a coexpressed viral RNA polymerase (T7 gnl) . This viral 
polymerase is supplied by host strains BL21(DE3) or 
HMS174 (DE3 ) from a resident A prophage harboring a T7 gnl 
gene under the transcriptional control of the lacUV 5 
promoter. 

15 One strategy to maximize recombinant protein expression 

in E. coli is to express the protein in a host bacteria with 
an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, 

20 California (1990) 119-128) . Another strategy is to alter 

the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for 
each amino acid are those preferentially utilized in E. coli * 
(Wada et al . (1992) Nucleic Acids Res. 20:2111-2118). Such 

25 alteration of nucleic acid sequences of the invention can be 
carried out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari et al . (1987) 

30 EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 
30:933-943), pJRY88 (Schultz et al . (1987) Gene 54:113-123), 
pYES2 (Invitrogen Corporation, San Diego, CA) , and pPicZ 
(Invitrogen Corp, San Diego, CA) . 

Alternatively, the expression vector is a baculovirus 

35 expression vector. Baculovirus vectors available for 



expression of proteins in cultured insect cells (e.gr. # Sf 9 
cells) include the pAc series (Smith et al . (1983) Mol . Cell 
Biol. 3:2156-2165) and the pVL series (Lucklow and Summers 
(1989) Virology 170:31-39) . 
5 In yet another embodiment, a nucleic acid of the 

invention is expressed in mammalian cells using a mammalian 
expression vector. Examples of mammalian expression vectors 
include pCDM8 (Seed (1987) Nature 329:840) and pMT2PC 
(Kaufman et al . (1987) EMBO J. 6:187-195). When used in 

10 mammalian cells, the expression vector's control functions 
are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. For 
other suitable expression systems for both prokaryotic and 

15 eukaryotic cells see chapters 16 and 17 of Sambrook et al . , 
supra . 

In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of the 
nucleic acid preferentially in a particular cell type (e.gr., 

20 tissue-specific regulatory elements are used to express the 
nucleic acid) . Tissue-specific regulatory elements are 
known in the art. Non-limiting examples of suitable 
tissue- specific promoters include the albumin promoter 
(liver-specific; Pinkert et al . (1987) Genes Dev. 

25 1:268-277), lymphoid- specif ic promoters (Calame and Eaton 

(1988) Adv. Immunol. 43:235-275), in particular promoters of 
T cell receptors (Winoto and Baltimore (1989) EMBO J. 
8 : 729-733) and immunoglobulins (Banerji et al . (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), 

30 neuron-specific promoters (e.g., the neurofilament promoter; 
Byrne and Ruddle (1989) Proc . Natl. Acad. Sci . USA 
86:5473-5477), pancreas- specif ic promoters (Edlund et al . 
(1985) Science 230:912-916), and mammary gland-specific 
promoters (e.g., milk whey promoter; U.S. Patent No. 

35 4,873,316 and European Application Publication No. 264,166). 
Developmentally-regulated promoters are also encompassed, 



for example the mouse hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the beta-f etoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant expression 
5 vector comprising a DNA molecule of the invention cloned 
into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a regulatory- 
sequence in a manner which allows for expression (by 
transcription of the DNA molecule) of an RNA molecule which 

10 is antisense to the mRNA encoding a polypeptide of the 
invention. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 

15 instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense RNA. 
The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which 

2 0 antisense nucleic acids are produced under the control of a 

high efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is 
introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub et al . 
25 (Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells 
into which a recombinant expression vector of the invention 
has been introduced. The terms "host cell" and "recombinant 
host cell" are used interchangeably herein. It is 

3 0 understood that such terms refer not only to the particular 

subject cell but to the progeny or potential progeny of such 
a cell . Because certain modifications may occur in 
succeeding generations due to either mutation or 
environmental influences, such progeny may not, in fact, be 
35 identical to the parent cell, but are still included within 
the scope of the term as used herein. 



A host cell can be any prokaryotic (e.g., E. coli) or 
eukaryotic cell (e.g., insect cells, yeast or mammalian 
cells) . 

Vector DNA can be introduced into prokaryotic or 
5 eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to 
a variety of art -recognized techniques for introducing 
foreign nucleic acid into a host cell, including calcium 

10 phosphate or calcium chloride co-precipitation, 

DEAE-dextran-mediated transfection, lipof ection, or 
electroporation . Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al . 
{supra), and other laboratory manuals. 

15 Another aspect of this invention features isolated or 

recombinant proteins and polypeptides of the invention. 
Preferred proteins and polypeptides possess at least one 
biological activity possessed by the corresponding 
naturally-occurring human polypeptide. An activity, a 

20 biological activity, or a functional activity of a 

polypeptide or nucleic acid of the invention refers to an 
activity exerted by a protein, polypeptide or nucleic acid 
molecule of the invention on a responsive cell as determined 
in vivo or in vitro, according to standard techniques. Such 

25 activities can be a direct activity, such as an association 
with or an enzymatic activity on a second protein, or an 
indirect activity, such as a cellular signaling activity 
mediated by interaction of the protein with a second 
protein. 

30 An "isolated" or "purified" protein or biologically 

active portion thereof is substantially free of cellular 
material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or 
substantially free of chemical precursors or other^ chemicals 

35 when chemically synthesized. The language "substantially 

free of cellular material" includes preparations of protein 



in which the protein is separated from cellular components 
of the cells from which it is isolated or recombinantly 
produced. Thus, protein that is substantially free of 
cellular material includes preparations of protein having 
5 less than about 30%, 20%, 10%, or 5% (by dry weight) of 
heterologous protein (also referred to herein as a 
"contaminating protein"). When the protein or biologically 
active portion thereof is recombinantly produced, it is also 
preferably substantially free of culture medium, i.e., 

10 culture medium represents less than about 20%, 10%, or 5% of 
the volume of the protein preparation. When the protein is 
produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
chemicals, i.e., it is separated from chemical precursors or 

15 other chemicals which are involved in the synthesis of the 
protein. Accordingly such preparations of the protein have 
less than about 30%, 20%, 10%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide 
of interest . 

2 0 To determine the percent identity of two amino acid 

sequences or of two nucleic acids, the sequences are aligned 
for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic 
acid, sequence for optimal alignment with a second amino or 

2 5 nucleic acid sequence) . The amino acid residues or 

nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in 
the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the 

3 0 second sequence, then the molecules are identical at that 

position. The percent identity between the two sequences is 
a function of the number of identical positions shared by 
the sequences (, % identity = # of identical positions/total 
# of positions (e.g., overlapping positions) x 100) . In one 
3 5 embodiment, the two sequences are the same length. 



The determination of percent identity between two sequences 
can be accomplished using a mathematical algorithm. A 
preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of two sequences is the 
5 algorithm of Karlin and Altschul (1990) Proc . Natl. Acad. 
Sci. USA 87:2264-2268, modified as in Karlin and Altschul 
(1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an 
algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul, et al . (1990) J*. Mol . Biol. 215:403- 

10 410. BLAST nucleotide searches can be performed with the 
NBLAST program, score - 100, wordlength = 12 to obtain 
nucleotide sequences homologous to a nucleic acid molecules 
of the invention. BLAST protein searches can be performed 
with the XBLAST program, score = 50, wordlength = 3 to 

15 obtain amino acid sequences homologous to a protein 

molecules of the invention. To obtain gapped alignments for 
comparison purposes, Gapped BLAST can be utilized as 
described in Altschul et al . (1997) Nucleic Acids Res. 
25:3389-3402. Alternatively, PSI-Blast can be used to 

20 perform an iterated search which detects distant 

relationships between molecules (Id.). When utilizing 
BLAST, Gapped BLAST, and PSI-Blast programs, the default 
parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See http://www.ncbi/nlm.nih.gov. 

25 Another preferred, non- limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the 
algorithm of Myers and Miller, CABIOS (1989) . Such an 
algorithm is incorporated into the ALIGN program (version 
2.0) which is part of the CGC sequence alignment software 

30 package. When utilizing the ALIGN program for comparing 

amino acid sequences, a PAM12 0 weight residue table, a gap 
length penalty of 12, and a gap penalty of 4 can be used. 
Additional algorithms for sequence analysis are known in the 
art and include ADVANCE and ADAM as described in Torellis 

35 and Robotti (1994) Comput. Appl . Biosci . , 10:3-5; and FASTA 
described in Pearson and Lipman (1988) Proc . Natl. Acad. 



Sci . 85:2444-8. Within FASTA, ktup is a control option that 
sets the sensitivity and speed of the search. If ktup=2 , 
similar regions in the two sequences being compared are 
found by looking at pairs of aligned residues; if ktup=l, 
5 single aligned amino acids are examined. ktup can be set to 
2 or 1 for protein sequences, or from 1 to 6 for DNA 
sequences. The default if ktup is not specified is 2 for 
proteins and 6 for DNA. For a further description of FASTA 
parameters, see 

10 http : //bioweb . pasteur . f r/docs/man/man/f asta . 1 . html#sect2 , 

the contents of which are incorporated herein by reference. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 

15 percent identity, typically exact matches are counted. 

The invention also provides chimeric or fusion 
proteins. As used herein, a "chimeric protein" or "fusion 
protein" comprises all or part (preferably biologically 
active) of a polypeptide of the invention operably linked to 

20 a heterologous polypeptide (i.e., a polypeptide other than 
the same polypeptide of the invention) . Within the fusion 
protein, the term "operably linked" is intended to indicate 
that the polypeptide of the invention and the heterologous 
polypeptide are fused in- frame to each other. The 

2 5 heterologous polypeptide can be fused to the N- terminus or 

C- terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in 
which the polypeptide of the invention is fused to the 
C-terminus of GST sequences. Such fusion proteins can 
30 facilitate the purification of a recombinant polypeptide of 
the invention. 

In another embodiment, the fusion protein contains a 
heterologous signal sequence at its N-terminus. For 
example, the native signal sequence of a polypeptide of the 

3 5 invention can be removed and replaced with a signal sequence 

from another protein. For example, the gp6 7 secretory 



sequence of the baculovirus envelope protein can be used as 
a heterologous signal sequence (Current Protocols in 
Molecular Biology, Ausubel et al . , eds . , John Wiley & Sons, 
1992) . Other examples of eukaryotic heterologous signal 
5 sequences include the secretory sequences of melittin and 
human placental alkaline phosphatase (Stratagene; La Jolla, 
California). In yet another example, useful prokaryotic 
heterologous signal sequences include the phoA secretory 
signal (Sambrook et al . , supra) and the protein A secretory 

10 signal (Pharmacia Biotech; Piscataway, New Jersey) . 

The invention further features antibodies, such as 
monoclonal or polyclonal antibodies or fragments thereof, 
that specifically bind a polypeptide of the invention. The 
antibodies of the invention can be conjugated antibodies 

15 comprising, for example, therapeutic or diagnostic agents. 
For example, the antibodies can be conjugated to a 
therapeutic moiety such as a chemotherapeutic cytotoxin, 
e.g., a cytostatic or cytocidal agent (e.g., paclitaxol, 
cytochalasin B or diphtheria toxin) , a thrombotic or anti- 

20 angiogenic agent or a radioactive or fluorescent label. 

An isolated polypeptide of the invention, or a fragment 
thereof, can be used as an immunogen to generate antibodies 
using standard techniques for polyclonal and monoclonal 
antibody preparation. The full-length polypeptide or 

25 protein can be used or, alternatively, the invention 

provides antigenic peptide fragments for use as immunogens . 
The antigenic peptide of a protein of the invention 
comprises at least 8 (preferably 10, 15, 20, or 30) amino 
acid residues of a protein of the invention, and encompasses 

30 an epitope of the protein such that an antibody raised 

against the peptide forms a specific immune complex with the 
protein. 

Preferred epitopes encompassed by the antigenic peptide 
are regions that are located on the surface of the protein, 
35 e.gr., hydrophilic regions. 



An immunogen typically is used to prepare antibodies by 
immunizing a suitable subject, {e.g., rabbit, goat, mouse or 
other mammal) . An appropriate immunogenic preparation can 
contain, for example, recombinantly expressed or chemically 
5 synthesized polypeptide. The preparation can further 

include an adjuvant, such as Freund 1 s complete or incomplete 
adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains 
to antibodies directed against a polypeptide of the 

10 invention. The term "antibody" as used herein refers to 

immunoglobulin molecules and immunologically active portions 
of immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as a polypeptide of the invention. A molecule which 

15 specifically binds to a given polypeptide of the invention 
is a molecule which binds the polypeptide, but does not 
substantially bind other molecules in a sample, e.g., a 
biological sample, which naturally contains the polypeptide.. 
Examples of immunologically active portions of 

20 immunoglobulin molecules include F(ab) and F(ab f )2 fragments 
which can be generated by treating the antibody with an 
enzyme such as pepsin. The invention provides polyclonal 
and monoclonal antibodies. The term "monoclonal antibody" 
or "monoclonal antibody composition", as used herein, refers 

25 to a population of antibody molecules that contain only one 
species of an antigen binding site capable of immunoreacting 
with a particular epitope. 

Polyclonal antibodies can be prepared as described 
above by immunizing a suitable subject with a polypeptide of 

30 the invention as an immunogen. The antibody titer in the 
immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized polypeptide. If desired, 
the antibody molecules can be isolated from the mammal 

35 (e.g., from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain the 



IgG fraction. At an appropriate time after immunization, 
e.g., when the specific antibody titers are highest, 
antibody-producing cells can be obtained from the subject 
and used to prepare monoclonal antibodies by standard 
5 techniques, such as the hybridoma technique originally 

described by Kohler and Milstein (1975) Nature 256:495-497, 
the human B cell hybridoma technique (Kozbor et al . (1983) 
Immunol. Today 4:72), the EBV-hybridoma technique (Cole et 
al . (1985), Monoclonal Antibodies and Cancer Therapy, Alan 

10 R. Liss, Inc., pp. 77-96) or trioma techniques. The 

technology for producing hybridomas is well known (see 
generally Current Protocols in Immunology (1994) Coligan et 
al . (eds.) John Wiley & Sons, Inc., New York, NY) . 
Hybridoma cells producing a monoclonal antibody of the 

15 invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of 
interest, e.g., using a standard ELI S A assay. 

Alternative to preparing monoclonal antibody- secreting 
hybridomas, a monoclonal antibody directed against a 

2 0 polypeptide of the invention can be identified and isolated 
by screening a recombinant combinatorial immunoglobulin 
library (e.g., an antibody phage display library) with the 
polypeptide of interest. Kits for generating and screening 
phage display libraries are commercially available (e.g., 

2 5 the Pharmacia Recombinant Phage Antibody System, Catalog No. 
27-9400-01; and the Stratagene Surf ZAP™ Phage Display Kit, 
Catalog No. 240612). Additionally, examples of methods and 
reagents particularly amenable for use in generating and 
screening antibody display library can be found in, for 

30 example, U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT Publication 
No. WO 92/20791; PCT Publication No. WO 92/15679; PCT 
Publication No. WO 93/01288; PCT Publication No. WO 
92/01047; PCT Publication No. WO 92/09690; PCT Publication 

35 No. WO 90/02809; Fuchs et al . (1991) Bio/Technology 

9:1370-1372; Hay et al . (1992) Hum. Antibod. Hybridomas 



3:81-85; Huse et al . (1989) Science 246:1275-1281; Griffiths 
et al. (1993) EMBOJ. 12:725-734. 

Additionally, recombinant antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both human 
5 and non-human portions, which can be made using standard 
recombinant DNA techniques, are within the scope of the 
invention. A chimeric antibody is a molecule in which 
different portions are derived from different animal 
species, such as those having a variable region derived from 

10 a murine mAb and a human immunoglobulin constant region. 

(See, e.g., Cabilly et al . , U.S. Patent No. 4,816,567; and 
Boss et al . , U.S. Patent No. 4,816397, which are 
incorporated herein by reference in their entirety.) 
Humanized antibodies are antibody molecules from non-human 

15 species having one or more complementarily determining 

regions (CDRs) from the non-human species and a framework 
region from a human immunoglobulin molecule. (See, e.g., 
Queen, U.S. Patent No. 5,585,089, which is incorporated 
herein by reference in its entirety.) Such chimeric and 

2 0 humanized monoclonal antibodies can be produced by 

recombinant DNA techniques known in the art, for example 
using methods described in PCT Publication No. WO 87/02 671; 
European Patent Application 184,187; European Patent 
Application 171,4 96; European Patent Application 173,4 94; 

25 PCT Publication No. WO 86/01533; U.S. Patent No. 4,816,567; 
European Patent Application 125,023; Better et al . (1988) 
Science 240:1041-1043; Liu et al . (1987) Proc . Natl. Acad. 
Sci. USA 84:3439-3443; Liu et al . (1987) J. Immunol. 
139:3521-3526; Sun et al. (1987) Proc. Natl. Acad. Sci. USA 

30 84:214-218; Nishimura et al . (1987) Cane. Res. 47:999-1005; 
Wood et. al . (1985) Nature 314:446-449; and Shaw et al . 
(1988) J". Natl. Cancer Inst. 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al . (1986) Bio/Techniques 
4:214; U.S. Patent 5,225,539; Jones et al . (1986) Nature 

35 321:552-525; Verhoeyan et al . (1988) Science 239:1534; and 
Beidler et al . (1988) J. Immunol. 141:4053-4060. 



In addition, the polypeptides of the invention or 
biologically active portions thereof, or antibodies of the 
invention, can be incorporated into pharmaceutical 
compositions, which optionally include pharmaceutically 
5 acceptable carriers. 

In another aspect, the present invention provides 
methods for detecting the presence, activity or expression 
of a polypeptide of the invention in a biological sample by 
contacting the biological sample with an agent capable of 
10 detecting an indicator of the presence, activity or 

expression such that the presence activity or expression of 
a polypeptide of the invention is detected in the biological 
sample. 

In another aspect, the invention provides methods for 

15 modulating activity of a polypeptide of the invention 

comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a 
polypeptide of the invention such that activity or 
expression in the cell is modulated. In one embodiment, the 

20 agent is an antibody that specifically binds to a 

polypeptide of the invention. In another embodiment, the 
agent is a fragment of a polypeptide of the invention or a 
nucleic acid molecule encoding such a polypeptide fragment. 
In another embodiment, the agent modulates expression 

25 of a polypeptide of the invention by modulating 

transcription, splicing, or translation of an mRNA encoding 
a polypeptide of the invention. In yet another embodiment, 
the agent is a nucleic acid molecule having a nucleotide 
sequence that is antisense to the coding strand of an mRNA 

3 0 encoding a polypeptide of the invention. 

The present invention also provides methods to treat a 
subject having a disorder characterized by aberrant activity 
of a polypeptide of the invention or aberrant expression of 
a nucleic acid of the invention by administering an agent 

35 which is a modulator of the activity of a polypeptide of the 
invention or a modulator of the expression of a nucleic acid 



of the invention to the subject. In one embodiment, the 
modulator is a protein of the invention. In another 
embodiment, the modulator is a nucleic acid of the 
invention- In other embodiments, the modulator is a 
5 peptide, peptidomimetic , or other small molecule. 

The present invention also provides diagnostic assays 
for identifying the presence or absence of a genetic lesion 
or mutation characterized by at least one of: (i) aberrant 
modification or mutation of a gene encoding a polypeptide of 

10 the invention, (ii) mis-regulation of a gene encoding a 
polypeptide of the invention, and (iii) aberrant post- 
translational modification of the invention wherein a wild- 
type form of the gene encodes a protein having the activity 
of the polypeptide of the invention. 

15 In another aspect, the invention provides a method for 

identifying a compound that binds to or modulates the 
activity of a polypeptide of the invention. In general, 
such methods entail measuring a biological activity of the 
polypeptide in the presence and absence of a test compound 

2 0 and identifying those compounds which alter the activity of 
the polypeptide. 

The invention also features methods for identifying a 
compound which modulates the expression of a polypeptide or 
nucleic acid of the invention by measuring the expression of 

25 the polypeptide or nucleic acid in the presence and absence 
of the compound. 

For all nucleic acids of the invention, in a preferred 
embodiment, the nucleic acids of the invention are any or 
all those referred to above, with the proviso that the 

30 sequence of any given nucleic acid of the invention does not 
comprise or is not 100% identical to the sequence provided 
in table I, or any sequence referred to by accession number 
or name in table I . 



PHARMACEUTICAL COMPOSITIONS 

The SERMs identified by the methods of the invention, 
nucleic acid molecules, polypeptides, and antibodies (also 
referred to herein as "active compounds") of the invention 
5 can be incorporated into pharmaceutical compositions 

suitable for administration. Such compositions typically 
comprise the SERM, nucleic acid molecule, protein, or 
antibody and a pharmaceutically acceptable carrier. As used 
herein the language "pharmaceutically acceptable carrier" is 

10 intended to include any and all solvents, dispersion media, 
coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known 

15 in the art. Except insofar as any conventional media or 

agent is incompatible with the active compound, use thereof, 
in the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 
The invention includes methods for preparing 

2 0 pharmaceutical compositions for modulating the expression or 

activity of a polypeptide or nucleic acid of the invention. 
Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression 
or activity of a polypeptide or nucleic acid of the 
25 invention. Such compositions can further include additional 
active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which 
modulates expression or activity of a polypeptide or nucleic 

3 0 acid of the invention and one or more additional active 

compounds . 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
35 include parenteral, e.gr., intravenous, intradermal, 
subcutaneous, oral (e.gr., inhalation), transdermal 



(topical), transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, intradermal, 
or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, 
5 saline solution, fixed oils, polyethylene glycols, 

glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium 
bisulfite; chelating agents such as 

10 ethylenediaminetetraacetic acid; buffers such as acetates, 
citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be 
adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be 

15 enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use 
include sterile aqueous solutions (where water soluble) or 
dispersions and sterile powders for the extemporaneous 

20 preparation of sterile injectable solutions or dispersions. 
For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor EL™ 
(BASF; Parsippany, NJ) or phosphate buffered saline (PBS) . 
In all cases, the composition must be sterile and should be 

25 fluid to the extent that easy syringability exists. It must 
be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, 

3 0 water, ethanol , polyol (for example, glycerol, propylene 

glycol, and liquid polyetheylene glycol, and the like), and 
suitable mixtures thereof . The proper fluidity can be 
maintained, for example, by the use of a coating such as 
lecithin, by the maintenance of the required particle size 

35 in the case of dispersion and by the use of surfactants. 

Prevention of the action of microorganisms can be achieved 



by various antibacterial and antifungal agents, for example, 
parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, 
and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols 
5 such as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

10 Sterile injectable solutions can be prepared by 

incorporating the active compound {e.g., a polypeptide or 
antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients enumerated above, 
as required, followed by filtered sterilization. Generally, 

15 dispersions are prepared by incorporating the active 

compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for 
the preparation of sterile injectable solutions, the 

2 0 preferred methods of preparation are vacuum drying and 

f reeze-drying which yields a powder of the active ingredient 
plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or 
25 an edible carrier. They can be enclosed in gelatin capsules 
or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be 
incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can also 

3 0 be prepared using a fluid carrier for use as a mouthwash, 

wherein the compound in the fluid carrier is applied orally 
and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or 
adjuvant materials can be included as part of the 
35 composition. The tablets, pills, capsules, troches and the 
like can contain any of the following ingredients, or 



compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating agent 
such as alginic acid, Primogel, or corn starch; a lubricant 
5 such as magnesium stearate or Sterotes; a glidant such as 
colloidal silicon dioxide; a sweetening agent such as 
sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are 

10 delivered in the form of an aerosol spray from a pressurized 
container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or 
transdermal means. For transmucosal or transdermal 

15 administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are 
generally known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can 

2 0 be accomplished through the use of nasal sprays or 

suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or 
creams as generally known in the art. 

The compounds can also be prepared in the form of 
25 suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared 
with carriers that will protect the compound against rapid 

3 0 elimination from the body, such as a controlled release 

formulation, including implants and microencapsulated 
delivery systems. Biodegradable, biocompatible polymers can 
be used, such as ethylene vinyl acetate, polyanhydrides , 
polyglycolic acid, collagen, polyorthoesters , and polylactic 
35 acid. Methods for preparation of such formulations will be 
apparent to those skilled in the art. The materials can 



also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutically acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for 
example, as described in U.S. Patent No. 4,522,811. 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 

10 administration and uniformity of dosage. Dosage unit form 

as used herein refers to physically discrete units suited as 
unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in 

15 association with the required pharmaceutical carrier. The 

specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique 
characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations 

2 0 inherent in the art of compounding such an active compound 
for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 
100 mg/kg of body weight (generally 10 mg/kg to 20 mg/kg) . 
If the antibody is to act in the brain, a dosage of 50 mg/kg 

25 to 100 mg/kg is usually appropriate. Generally, partially 
human antibodies and fully human antibodies have a longer 
half -life within the human body than other antibodies. 
Accordingly, lower dosages and less frequent administration 
is often possible. Modifications such as lipidation can be 

30 used to stabilize antibodies and to enhance uptake and 

tissue penetration (e.gr., into the brain). A method for 
lipidation of antibodies is described by Cruikshank et al . 
( (1997) J". Acquired Immune Deficiency Syndromes and Human 
Retrovirology 14:193) . 

35 As defined herein, a therapeutically effective amount 

of a SERM, protein or polypeptide (i.e., an effective 



dosage) ranges from about 0.001 to 30 mg/kg body weight, 
preferably about 0.01 to 25 mg/kg body weight, more 
preferably about 0.1 to 20 mg/kg body weight, and even more 
preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 
5 4 to 7 mg/kg, or 5 to 6 mg/kg body weight. 

The skilled artisan will appreciate that certain 
factors may influence the dosage required to effectively 
treat a subject, including but not limited to the severity 
of the disease or disorder, previous treatments, the general 

10 health and/or age of the subject, and other diseases 
present. Moreover, treatment of a subject with a 
therapeutically effective amount of a protein, polypeptide, 
or antibody can include a single treatment or, preferably, 
can include a series of treatments. In a preferred example, 

15 a subject is treated with a SERM, antibody, protein, or 

polypeptide in the range of between about 0.1 to 2 0 mg/kg 
body weight, one time per week for between about 1 to 10 
weeks, preferably between 2 to 8 weeks, more preferably 
between about 3 to 7 weeks, and even more preferably for 

20 about 4, 5, or 6 weeks. It will also be appreciated that 
the effective dosage of antibody, protein, or polypeptide 
used for treatment may increase or decrease over the course . 
of a particular treatment- Changes in dosage may result and 
become apparent from the results of diagnostic assays as 

25 described herein. 

The present invention encompasses agents which modulate 
expression or activity. An agent may, for example, be a 
small molecule. For example, such small molecules include, 
but are not limited to, peptides, pept idomimet ics , amino 

3 0 acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or 
inorganic compounds (i.e, . including heteroorganic and 
organometallic compounds) having a molecular weight less 
than about 10,000 grams per mole, organic or inorganic 

35 compounds having a molecular weight less than about 5,000 
grams per mole, organic or inorganic compounds having a 



molecular weight less than about 1,000 grams per mole, 
organic or inorganic compounds having a molecular weight 
less than about 500 grams per mole, and salts, esters, and 
other pharmaceutically acceptable forms of such compounds. 
5 It is understood that appropriate doses of small 

molecule agents depends upon a number of factors within the 
ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, 
for example, depending upon the identity, size, and 

10 condition of the subject or sample being treated, further 
depending upon the route by which the composition is to be 
administered, if applicable, and the effect which the 
practitioner desires the small molecule to have upon the 
nucleic acid or polypeptide of the invention. Exemplary 

15 doses include milligram or microgram amounts of the small 
molecule per kilogram of subject or sample weight (e.g., 
about 1 microgram per kilogram to about 500 milligrams per 
kilogram, about 100 micrograms per kilogram to about 5 
milligrams per kilogram, or about 1 microgram per kilogram 

20 to about 50 micrograms per kilogram. It is furthermore 

understood that appropriate doses of a small molecule depend 
upon the potency of the small molecule with respect to the 
expression or activity to be modulated. Such appropriate 
doses may be determined using the assays described herein. 

2 5 When one or more of these small molecules is to be 

administered to an animal (e.gr., a human) in order to 
modulate expression or activity of a polypeptide or nucleic 
acid of the invention, a physician, veterinarian, or 
researcher may, for example, prescribe a relatively low dose 
30 at first, subsequently increasing the dose until an 

appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular 
animal subject will depend upon a variety of factors 
including the activity of the specific compound employed, 

3 5 the age, body weight, general health, gender, and diet of 

the subject, the time of administration, the route of 



administration, the rate of excretion, any drug combination, 
and the degree of expression or activity to be modulated. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 
5 Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (U.S. 
Patent 5,328,470) or by stereotactic injection (see, e.gr., 
Chen et al . (1994) Proc . Natl. Acad. Sci . USA 91:3054-3057). 
The pharmaceutical preparation of the gene therapy vector 

10 can include the gene therapy vector in an acceptable 

diluent, or can comprise a slow release matrix in which the 
gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from 
recombinant cells, e.g., retroviral vectors, the 

15 pharmaceutical preparation can include one or more cells 
which produce the gene delivery system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with instructions 
for' administration. 

20 

METHODS OF TREATMENT 

The present invention provides for both 
prophylactic and therapeutic methods of treating a subject 
at risk of (or susceptible to) a disorder or having a 

25 disorder associated with aberrant expression or activity of 
a polypeptide of the invention. The nucleic acids, 
polypeptides, and modulators thereof of the invention can be 
used to treat cancer (e.g., breast or endometrial), 
osteoporosis and cardiovascular disease (e.g., myocardial 

30 infarction, atherosclerosis), restenosis, Alzheimer's 

disease, urinary incontinence, hair loss and decreased or 
absent libido, as well as other disorders described herein. 
SERMs and other therapeutic agents may be administered to a 
patient in an amount sufficient to exert a therapeutic 

35 effect. Such an effect may prevent onset of a disease, or 
may delay disease onset or reduce disease severity or 



duration. Conditions that may benefit from such therapies 
(prophylactically or after onset of the disease) include 
cancer (e.g., breast or endometrial), osteoporosis and 
cardiovascular disease (e.gr., myocardial infarction, 
5 atherosclerosis), restenosis, Alzheimer's disease, urinary 
incontinence, hair loss and decreased or absent libido.' 



Prophylactic Methods 

10 In one aspect, the invention provides a method for 

preventing in a subject, a disease or condition associated 
with an aberrant expression or activity of a polypeptide of 
the invention, by administering to the subject an agent 
which modulates expression or at least one activity of the 

15 polypeptide. Subjects at risk for a disease which is caused 
or contributed to by aberrant expression or activity of a 
polypeptide of the invention can be identified by, for 
example, any or a combination of diagnostic or prognostic 
assays as described herein. Administration of a 

20 prophylactic agent can occur prior to the manifestation of 
symptoms characteristic of the aberrancy, such that a 
disease or disorder is prevented or, alternatively, delayed 
in its progression. Depending on the type of aberrancy, for 
example, an agonist or antagonist agent can be used for 

25 treating the subject. The appropriate agent can be 

determined based on screening assays described herein. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of 
3 0 modulating expression or activity of a polypeptide of the 
invention for therapeutic purposes. The modulatory method 
of the invention involves contacting a cell with an agent 
that modulates one or more of the activities of the 
polypeptide. An agent that modulates activity can be an 
35 agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of the 



polypeptide, a peptide, a peptidomimetic , or other small 
molecule. In one embodiment, the agent stimulates one or 
more of the biological activities of the polypeptide. 
Examples of such stimulatory agents include the active 
5 polypeptide of the invention and a nucleic acid molecule 
encoding the polypeptide of the invention that has been 
introduced into the cell. In another embodiment, the agent 
inhibits one or more of the biological activities of the 
polypeptide of the invention. Examples of such inhibitory 

10 agents include antisense nucleic acid molecules and 

antibodies. These modulatory methods can be performed in 
vitro (e.g., by culturing the cell with the agent) or, 
alternatively, in vivo (e.gr., by administering the agent to 
a subject) . As such, the present invention provides methods 

15 of treating an individual afflicted with a disease or 

disorder characterized by aberrant expression or activity of 
a polypeptide of the invention. In one embodiment, the 
method involves administering an agent (e.gr., an agent 
identified by a screening assay described herein) , or 

20 combination of agents that modulates (e.g., upregulates or 
downregulates) expression or activity. In another 
embodiment, the method involves administering a polypeptide 
of the invention or a nucleic acid molecule of the invention 
as therapy to compensate for reduced or aberrant expression 

25 or activity of the polypeptide. 

Stimulation of activity is desirable in situations in 
which activity or expression is abnormally low or 
downregulated and/or in which increased activity is likely 
to have a beneficial effect. Conversely, inhibition of 

30 activity is desirable in situations in which activity or 

expression is abnormally high or upregulated and/or in which 
decreased activity is likely to have a beneficial effect. 

This invention is further illustrated by the following 
examples which should not be construed as limiting. The 

35 contents of all references, patents and published patent 



applications cited throughout this application are hereby 
incorporated by reference. 



EXAMPLES 
Example 1 

Preparation of Estrogen -Receptor Enhanced Vascular 
5 Endothelial Cells 

Estrogen receptor a (ERa) -enhanced vascular endothelial 
cells were prepared by infecting primary vascular 
endothelial cells with a retrovirus expressing human 
papillomavirus E6/E7, which leads to cellular 

10 immortalization. The retrovirus expression vector 

LXSN16E6E7 contains the immortalizing oncogenes E6 and E7 
from human papilloma virus type 16 and a neomycin resistance 
gene. (Halbert, C.L., Demers, F.W., and Galloway, D. A. 
1991. J Virol. 65: 473-478 (1)). This vector was 

15 transfected into PA317 cells, an amphotrophic packaging cell 
line, by calcium phosphate precipitation. Stably clonal 
lines were established by G418 selection. A negative 
control line expressing the empty retrovirus LXSN was 
established. Viruses produced by the clonal lines of PA317 

2 0 were used to infect second passage primary human aortic 

endothelial cells obtained from Cell Applications, Inc., San 
Diego, CA. Following infection and selection with G418, the 
LXSN infected cells ceased growth after approximately 7 
passages where the LXSN16E6E7 infected cells were growth 

2 5 extended beyond 2 0 passages. 

These immortalized cells showed very low expression of 
ERP mRNA and no detectable ERa mRNA, and no estrogen 
response could be detected. The cells expressed the 
expected endothelial markers. The levels of ERa were 

30 increased by infection of the cells with a retrovirus that 

expresses recombinant ERa (Green, S. et al , Nature 1986 32 0: 
134-137) . Retroviral vectors were constructed to express 
estrogen receptor a (LXSZERa) or p (LXSZERP) and confer 
zeocin resistance. A clonal line of the immortalized 

35 vascular endothelial cells at passage 8 post-immortalization 



was infected with virus produced by PA317 cell transfected 
with these ER expressing retroviruses. Following zeocin 
treatment and clonal selection, one immortalized vascular 
endothelial cell line expressing ERct and one expressing ER(5 
5 were established. Estrogen receptor expression was verified 
by RNase protection, Western blotting, and immunostaining . 

Enhancement of the level of ERa protein in these cells 
was found in greater than 90% of the cells by 

immunocytochemistry . The expression of vascular endothelial 
10 growth factor (VEGF) mRNA was increased three fold by 

treating the cell line (VE-ERa) with estrogen (shown in 
Figure 1 by, RNAse protection analysis) . This demonstrates 
the estrogen responsiveness of these cells. 

15 

Example 2 

Identification of Estrogen-Regulated Markers 

This Example illustrates the use of VE-ERa cells to 
screen for estrogen-regulated markers. 

2 0 The Incyte UniGEM technology was used to survey a large 

panel of genes and to establish the estrogen regulated gene 
profile of the VE-ERa cells. VE-ERa cells were treated for 
6 hours with 17-P estradiol or vehicle control (ethanol) . 
Poly (A) RNA was isolated and this RNA was used to 
. 25 synthesize Cy3 or Cy5 labeled cDNA for simultaneous probing 
of Incyte UniGem V. Reciprocal labeling was done and two 
UniGems were probed (for one GEM the estradiol probe was 
labeled with Cy3 and ethanol probe with Cy5 and this 
labeling was reversed for the second GEM) . UniGem V 

3 0 contains approximately 850 0 human sequences including both 

known genes and ESTs . More than 20 genes were identified 
which were regulated more than 2 fold on both UniGEMs (Table 
I, indicated as UniGem. The sequence indicated in Table I 
with RF were identified by RNA fingerprinting, a more 
35 stringent version of differential display. RNA 



fingerprinting is based upon the use of arbitrary primers 
and primer combinations to carry out PCR amplifications from 
cDNA populations derived from cells or tissues of interest. 
The process generates a series of radioactively-labeled 
5 fragments which can be resolved by PAGE and visualized by 
autoradiography- (Mathieu-Daude , et al . Methods Enzymol 
1999/303:309; Stone, et al . , Nucleic Acids Res 1994 Jul 
11;22 (13) :2612-8. ) 

Side by side comparisons of such gel displays generated 

10 from distinct cell states or tissues types allow the 

investigator to identify differentially gene expression in 
the target cells or tissues. RNA was isolated from VE103ERa 
cells (control Vs treated for 24 hours with 100 nM 17-p- 
estradiol) . For initial confirmation studies of genes 

15 indent ified by RNA fingerprinting, a northern blot 

containing poly (A) RNA (vehicle control versus 100 nM 17 
estradiol) is probed with each gene. This step is not 
necessary with genes identified by the UniGem where an 
estrogen treatment time course was performed. RNA was 

2 0 isolated and RNase protection analysis was used to determine 
estrogen responses (Figure 2) . All 2 0 of the genes examined 
were confirmed to be regulated to a higher fold modulation 
at 24 hours than the 6 hours used for the UniGem experiment . 
Table I provides a summary of estrogen-regulated markers 

25 identified in this study. 
Table I 

Representative Estrogen-Regulated Markers 



ERM 


Sequence or Database Name 


Acc . No . 


Detection 


1 


Tumor necrosis receptor 2 (75kD) 


M32315 


UniGem 


2 


Selectin P (granule membrane 
protein 140) 


M25322 


UniGem 


3 


Prostaglandin 12 (prostacyclin) 
synthase 

The enzyme required for the last 
step in the synthesis of PGI2, an 
inhibitor of atherogenesis 


D84124 


UniGem&RF 



4 


Latent transforming growth factor 
beta binding protein 


M34057 


UniGem&RF 


5 


N-acetyllactosamine synthase ( - 
galactosyl transferase) 


Dz 9 8 05 


T T-i_r-. -1 /""I * .nj-. r** T^i T^n 

UmGem&RF 


6 


Iduronate 2-sulfatase (Hunter 
syndrome ) 


L13329 


UniGem 


7 


RIG mRNA (Regulated ^En Glioma) 


U32331 


UniGem 


8 


NEF-associated factor 1 alpha 


D30755 


UniGem 


9 


hbc647 


U68494 


UniGem 


10 


Follistatin 


M19481 


UniGem 


11 


Epidermal growth factor receptor 
kinase substrate (eps8) 

A signaling molecule that serves 
as a substrate for receptor and 
nonreceptor tyrosine kinases. 
Eps8 appears to be involved in 
cell proliferation and 
cytoskeleton remodeling. 


U12535 


UniGem 


12 


Cyclin Al 

A cyclin A-like gene, having 48% 
identity to human cyclin A and 84% 
identity to murine cyclin Al (Yang 
et al . , Cancer Res. 57:913-920, 
1997) 


U66838 


UniGem 


13 


Serum inducible kinase mRNA 


AF059617 


UniGem 


14 


Sarc/ endoplasmic reticulum 
CaATPase 


Y15724 


UniGem 


15 


Podocalyxin protein 

A ligand for L-selectin similar in 
structure to CD34 


U97519 


UniGem 


16 


Natriuretic peptide receptor B 
(NPR-B) 

The guanylyl cyclase receptor for 
C-type natriuretic peptide (CNP) , 
and endothelium dependent 
venodilator that has antimitogenic 
activity on smooth muscle cells 


L13436 


UniGem 


17 


Leupaxin 


AF062075 


UniGem 


18 


Ezrin-radixin-moesin binding 
phosphoprot e in 


AF015926 


UniGem 


19 


Hexokinase I 


AF016365 


UniGem 


20 


Connexin 3 7 (Gap junction protein 
alpha 4) 

A gap junction protein in 
endothelial cells 


M96789 


UniGem 



21 


Diastrophic dysplasia sulphate 
transporter 

This gene is mutated in a form of 
osteochondrodysplasia that leads 
to dwarfism and spinal deformation 


U14528 


UniGem 


22 


Cadherin 11 (OB-cadherin) 


D21255 


UniGem 


23 


6 phospho fructo-2 kinase/f ructose 
2,6 bis phosphatase 


D49817 


UniGem 


24 


Ras-related rhoB 

A member of the Rho family of 
GTPases that includes Rac, Rho and 
Cdc42 . Rho B has been implicated 
in cell growth control, actin 
regulation and adhesion dependent 
viability 


X06820 


RF 


25 


Transglutaminase 


M55153 . 1 


RF 


26 


Crk-associated substrate pl30Cas 

A signaling molecule important for 
pathways involved in cell 
invasion, adhesion and survival 


AB040024 


RF 


27 


Clusterin (Apolipoprotein J) 

Clusterin has been found at 
increased levels in 
atherosclerotic lesions and in 
wound r epa i r . 


J02908 


RF 


28 


Connective tissue growth factor 

Mediates cell adhesion and 
migration 


U14750 


RF 


29 


Putative type II membrane protein 


NM014254 


RF 


30 


Legumain 


Y09862 


RF 


31 


Complement component C4A 


K02403 


RF 


32 


WDR1 protein 


AF020056 


RF 


33 


ADP ribosylation factor 


AF052179 


RF 


34 


Casein Kinase II (alpha) 


M55265 


RF 


35 


CPSF (cleavage and polyadenylation 
specificity factor) 


AF171877 


RF 


36 


ATP Sulfurylase/APS Kinase 2 (PAPS 
synthetase 2) 

A bifunctional protein used to 
synthesize the universal sulfate 
donor PAPS ( 3 ' phosphoadenosine 
5 ' phosphosulf ate) . A mutation of 
this gene leads to a 


AF074331 


RF 





cartilage /skeletal disease 
(spondyloepimetaphyseal dysplasia 






3 / 


Human ubiquitin conjugating enzyme 
type UBC9 


T 1A C O O Q 


JA. 17 


3 O 


Small membrane protein 1 (SMP1) 
mRNA 


I\r U o ± o ^ 


Kr 


39 


Casein kinase II (beta) 


M30448 


RF 


40 


Trabeculin-alpha mRNA; actin 
binding protein ABP620 


AF141968 


RF 


41 


Ca+ Dependent Protease (Homo 
sapiens calpain, small polypeptide 
(CAPN4) 


X04106 


RF 


42 


Transforming acidic coiled-coil 
containing protein J tiAL.(_.i; mKJNA 


AF093543 


RF 


43 


Sterile-alpha motif and leucine 
zipper containing kinase (AZK) , 
mixed lineage kinase 


AF251441 


RF 


44 


Putative TH1 protein 


AJ238379 


RF 


>I IT 

4 5 


oUb aciQic riDosomax protein fvj 
mRNA) 


ATT TTOTQ 
J\r 1 / J -5 / O 


rCr 


46 


Tubulin, beta, 2 (TUBB2) mRNA 


NM006088 


RF 


47 


HLA-E gene (major 
histocompatibility complex) 


X64879 


RF 


48 


Myosin X (MYO10; mRNA 




T~> TT» 


49 


VE - cadher in- 5 

A mediator of endothelial cell 
adhesion 


X79981 


RF 


50 


Bromodomain adjacent to zinc 
finger domain, 2A (BAZ2A) 


AB002312 


RF 


51 


Carbohydrate ( chondroi t in6 / 
keratan) 

sulfotransf erase 2 

The key enzyme in the biosynthesis 
of chondroitin 6 -sulfate, a 
glycoaminoglylcan implicated in 
chondrogenesis , neoplasia, 
atherosclerosis and other 
processes . Oversulf ated 
chondroitin sulfate isomers have 
anticoagulant and anti- 
atheroscl erotic activities 
suggesting a pertinent role in the 
disease process . 


AF083066 


RF 


52 


Endothelin-1 


Y00749 


RF 


53 


ABP32 (peflin (PEF) 


AB026628 


RF 


54 


TAD A- 1 protein mRNA 


AF132000 


RF 


55 


CDC45L (CDC45L) mRNA 


AF081535 


RF 



56 


Homo sapiens cosmids 


U82671 


RF 


57 


Homo sapiens full length insert 

CDNA Y073E04 (SEQ ID NO: 1) 

TTTTTTTTTCTTTAAATAACAATTTGACAAAAGG 

GTGAAGAAATCCTAAACAAGGTATTGAGGCCAGT 

GTCCAGGCTGCATTCAGTTCACAGAACTGTCCTC 

AGGACGTTGCATGGAACTGGAAATGTGTATAATT 

ACAGAAGAAAACAGGGAGGACTTAGTGCAGAGAG 

GAGACGAGTGTGGACGGGCAACAGCATCCTTAGT 

CTTTCATATTTATATATGGTATATGTATTTTCTA 

TATATATATTTATATATTTTACATCCAGGTATCC 

CAGTCATCTGTACCATTTCCCAGGGAGACATGGG 

TGCTTCCAAGGCGAGACAGGAAAGGGTTAGGCAG 

GGAAGGGGCAGCGACGGTGCAGGCTGGGGCTTGG 

CTCACAGAAGCTGCAGGAGCTTCAGCAGCTGTAA 

GAGGGCCCCGGGCTCCGCAGACGCCAGGTACTGA 

GNCAAAGCCAGTCCTCCAGCTCCACGCCCCGCCT 

GCGATCCACCGCCTTCTCCGCAAACTTCATCATC 

ATCAGGGCCCGCTTCATGTCGATCCAGTTGTGCA 

GCGTGCGCACA 


AA747315 


UniGem 


58 


EST (SEQ ID NO: 2) 

TNNNTTTNTGTGGTNGTGCGGTGGGGGNGGTAGG 
GACAGTGTACAGAGTTCCAGCTCCTCTTTGATTC 
ACCCCCTTCCCTCCTCCTCCCTTCTTTCCCATTG 
CTACACTAAAGGAGTCAGGGCTGTGTCCACCAAT 
ATGCTCTGTGACTAGCAGTCAACACAATTTCATA 
CCCAGCCAGTCATTCCGGAGNATTTGGGATGATA 
ATGTTCCCATTCTCAGTTCCTCTGTGAGAATGTG 
NATGTCATTTGAAGCTCTGTGCCTCAGTTTCCAT 
AGAAGTAAAACAAAAAGCAAGAATTGAGACTGAT 
CTTTTNCTTGNAAGAATTCACANGTAATGGATAA 
TGTACAAGAGATGTGGCTGAATAAGAGAACTAGA 
ACAANGACTTGAACAATTTNCCTATGCAACACTC 
TTTCCCATGGACAAAGGTATCCCACAAAGTCATN 
CTTCATCGGAGGCTAGTTTGTCCAAAGAAAACTC 
ACATACCAATGCAATAGTGACAAAAAAAAATTCC 
TCCATGCCAATTCAGGAAACAGAACGCAAGTTGT 
GCTTAAGNCCATGGTNGGGTGTGGGAAGATGTTC 
CAANACCGNCACCCCTTACCCTCNTCTATGACTA 
TCTNCAACCCCCNGNANNGGGGCTCGNCAGTTTG 
NTAAAGGGGTTAATTACCANNAGGGGGCNTGTCC 
CCCCAGNGGTCNCTTGGGGNGTGANANNCNANAA 
AACCNCTNTCCATTTACCAAACAAANACACNTCC 
CTGGGATTATTNGGGGGTTNCAGGCTTTTNAGNC 
ANNTTCCANGCCNTNGNTCTGNCCCTNNTCCNNG 
GTTGGGAANCCATCCCCCCNGCTCCTGATTACCA 
ATNCCCNAATCCCNNGGTGGGGTTGNATTCATCT 
CTCCCAAAATGTTNGTTGGNTTCGGGGGGTNAAG 
ANTTT 


W28965 


UniGem 


59 


EST (SEQ ID NO: 3) 

CACAGGGTCACAGATTCACCAAAGCTGAAAGGGC 
TGAGGAGCTCATGGTAGCCTGGGTTGACCTACTC 
TGGAGCACGGTGTCTTCCTTCTAAACTGAGTGAC 


AA194161 


UniGem 





TGTAGTACTATCTGTGCCTCTGATGGTAATAAAA 
CTGACAAGATGTCTAATTTTTTTTTAAGTAGGAC 
CAAAGGAAAACAAGATTTAGATAGTCTGACTTTG 
CTTTTGAACAACAGACATTGCAAGTCAAAATTGT 
TGTCAAATTTACATATGGTAAATGATGAACTTTA 
AAAATGTGTCCAGGTGTTAGATGAGTTCATTAGA 
CTCTTTTAATGCTAATGGCTAGTACCGTTTAAAC 
AAAACAGCAGTTCTCTGCTGCAATATTCCCATTG 
ACCACTTAAATGACCATAAGTGGTCATTTAAGAA 
CATGTTAGGGTTAGCCCTGATCTGAATATAAAAG 
TGAGAAAAGGGCTACAGTGCATTTCTTGGTAACT 
TAAACTGAGTCCTGAAGTTATAATGATCCATTCG 
AGTTCNGTGATCCTTATTGGTCCTAATGGTGGTT 
CCCNACCGTATTGGTACAGATGAGCCATACGTTC 
CCTTGGTACCATGTAGACATGACTTCAGATACCC 
CTGAGGACCTNCCAGCA 






.60 


P311 mRNA (SEQ ID NO: 4) 

TCGACGCTNAGAGTGGACAGGGAGCGGGGCTTTT 

GTCTGTTGGTCTCCCTGGACTGAAGAGAGGGAGA 

ATAGAAGCCCAAGACTAAGATTCTCAAAATGGTT 

TATTACCCAGAACTCTTTGTCTGGGTCAGTCAAG 

AACCGTTTCCAAACAAGGACATGGAGGGAAGGCT 

TCCTAAGGGAAGACTTCCTGTCCCAAAGGAAGTG 

AGCCGCAAGAAGAACGATNAGACAAACGCTGCCT 

CCCTGACTCCACTGGGCAGCAGTGAACTCCGCTC 

CCCAAGAATCAGTTACCTCCACTTTTTTTAATCG 

TAACACCTCCATTTGTATTACATATGGTGTATGG 

GTATTGATGAGGTCATGGTATCATATATGGGATT 

TTTTTCTGTGTAAATCATCAAGTATAAGAAGAAA 

CTATGGGACTCTGAGCCTTGCTTTAGAGAANTTA 

CAGTGG 


U30521 


RF 


61 


Homo sapiens clone 24 992 mRNA 

sequence (SEQ ID NO: 5) 

CCGTCACTAKGCTGACCGCTGCAACTACCAGACT 

TTCTGTGCTGATGGAGCAGATGAGAGACGCTGTC 

GGCATTGCCAGCCTGGSAWTTTCCGATGCCGGGA 

CRAGAAGTGCGTGTATGAGACRTGGGTGTGCGAW 

GGGCACCCAAACTGTGCGGACGGCARTGATSAGT 

GGGACTGCTCCTATGTTCTGCCCCGCAAGGTCAT 

TACAGCTGCAGTCATTGGCAGCCTASTGTGCGGM 

CTGCTCCTGKTCATCGCCCTGGGCTGCACCTGCW 

WGCTCTATGCCATTCGCACCCAGGAGTACAGCAT 

CTTTGCCCCCCTCTCCCGGATGGAGGCTGAGATT 

GTGCATCAACAGGCACCCCCTTCCTACGGGCAGC 

TCATTGCCCAAGGTGCCATCCCACCTGTAGAATA 

CTTTCCTACAGAGAATCCTAATGATAACTCAGTG 

CTGGGCAACCTGCGTTCT 


AF131760 


RF 


62 


Human DNA sequence from clone RP5- 
863C7 on chromosome 20pl2.3-13. 
(AL049761) (SEQ ID NO: 6) 
ATCACAATCCAGTGCAAACATAATCACAAATTGC 
ATCTCTGGCACATCTGGTGCTTTTAGCTTCTGCA 
CAAATTCAACATGGTAACCCTCACAGCATTCTAG 
GGCATAAAAGGGTCAAGTACAATAAATATCCACY 


AL049761 


RF 





GTAAGTGGTCATAAGGAAGAAAAT C ACC CTGC CA 
CACAGTCAGAATGTTTTCCCTTCAGCTCKCCCTA 
CCGTTGAGAAGCGCTATAAAAGGAGGCTTTAKCS 
TCCTCTAAAGTTTACCATTATATTCACTAAAAGC 
CACCACCTTGATAAAGTTACTAAAGCCAAGATGG 
GTTACAAAGTTAGATAAGTTCATTAAATTCAACT 
CCCCAAACAATTACGTTTATCTATGATGCCTAGG 
CGAGAAAGGCCTGTGAATCTATAASGTAGGAGAA 
TGGGGAACTAAGATTTTAGGCCTTAAAATATGTT 
GGAA 
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Novel Sequence (SEQ ID NO: 7) 

CCGTCACTAGGCTGCCAGAAGATCCTCCAAGGAG 

CCATCCCAGACCCAGTGATCTGCTGCCACTCACA 

TTGTGGCAGTGACACTTCCAAAGCAGCTATTGTA 

TTCTGGAGCCTCCTGGATGAACATCAGCTTTATC 

ACCTGGAGCCTGTTACAAACGTAGAATCTTGACC 

TCAGCCCAGCCCTACTGAATCAGAGTCTACATTT 

TTAATGAGATCCCCANGTGATTCGCAAGCACATG 

AGAGTTTGAGAAGTGCTGTCTGGTTACTTATGGC 

AGCCTAATGACGGTAATCGAATT 




RF 


64 


Homology to TADA1 protein mRNA 
(SEQ ID NO: 8) 

TGCTGCACNAGTCAATCACANTTTGCAACTGCAA 
AAGAGGCATAAACTGACGGGGATCTCCTGCTTTA 
CTATGCACACACAGCTTAACTCTCCAACCGTGGA 
GAAC AAC TAAC TC T AT ANGAAAAATNAAAAGNT A 
AACTNAACCATCCAAGGATGGGGGNAAGGGCTTC 
ATGGGTAACTTCTGGCCTGTNAGCACTAAATTNC 
AAACTGCTCCTGCAAGAAACCTNAAGGGCTGGGA 
AGGCTTTATCTGGCTCAAAAACTAAGGNTGCTCT 
CNAAAGAATTCAAATTCTCCTCAACAANCNANAA 
ANANTKGTCAATCCAATTCAAGGCTCCCNTNANA 
TTGNAAGNANTTCCTNTNAACAACAACAAGGTGA 
CTKACTYCTKCAACAATYTAAATTCCCCCGGCSN 
C C ATGGGGGGCNGGAAANAT S S AANTNTNGGC CC 
AATTTCCCCANAATKAGTCTTATAAAAATCACWG 
GGCGTCTTTAAANGTCTKACTGGGNAAAAAC 




RF 


65 


Homology to PEX mRNA for 
perixisome biogenesis factor 
(peroxin) (SEQ ID NO: 9) 
CCATTCGTCCGTAGGTCCAGGTGTTTTACGTCAG 
CAGGGAAATGTGGCACACGCCCTCGAGGCATTTT 
AACACTGCGCTTCAGGAAATCTCAAGTTCCATCT 
TGTGTTAGTAACGTACCCACATTTTGCTGGAGTT 
AGTTTATTAAAGATGCCTACGGTGAACTCTCTGG 
CGCAGGTTAAATGCAGTTTTGAAAACCTGGAAAC 
ATCAAATGGAGGCGGGAAATAAGCTGGGGCCGAG 
CTGAGGGGCTGAACACGA 


AE003829 


RF 


66 


Homology to human DNA sequence 
from clone RP5-863C7 on chromosome 
20pl2 (SEQ ID NO: 10) 

TTGTGCTTAAAAAAATAGGCTTCCTCAGTGAAGC 
ACCTGATAAAMTTAGGTGGTTGGATTACAGTGGA 
AAGTCCACTTTACACACACACACAAACACACAAA 


AL049761 


RF 





AACTGGCGACTTTTTCCATCCCCAATCCCTGCAM 
TGSTGAGACACAAAATGAGTTTTWTGGCAAAGGA 
TCCTTAATCCCCAGAGACGCTTTGGCTTGTGGTG 
CTTTTTTTTTTAAGGCCTCTCTGCTCTGCCCGGT 
ACCATGGGTCGAACGAGGGGTGTATAAAATGGGG 
GCCTTGGGAAGCCTCCACGGTACAGGGCTGCAGG 
CCCCTCAGATGTGAACACGACGACCTGGGA 






67 


Human cDNA FLJ2 0400, clone 

KAT00587 (SEQ ID NO: 11) 

TGTCAGGGATACCGAGAGCCGGTGTTCTGTTAAC 

CTCTGGGAATGGTCCTCCCAATCCTAGGCCAGTT 

GGCCTGGGCCCAGGACCAAACCCCAATCTGAGAT 

CAGGCTTTTTAGGGACAAACCCTGCCCCCAGGTC 

AGGTGTGTTTCCAGGCCCAGGCCTTGGGCCCAAC 

CCAAGACCAAGTGGCCTGGGCCCAGGCCCTAATC 

TANATRCCAGAGCAGGTGGCCTCTTGGGCACAGG 

ATCTGGTCTTAACTTAAGAATGGCTGGACCTCAA 

GGCCTCGATCTTGCCCCCATTCTAAGAGCAGCAG 

GTCTTTTAGGAGCAAATTCAGCTTCTTTCTCACA 

GGCTTCTGGAAACATGGGCACAAGCCCATCCTCC 

ATGGCAAGAGTACCTGGCCCCATGGGCCCAAACT 

CGGGTCCTANCTCTCGGTATCCCTGACAATCGAA 


AK000407 


RF 
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EST55810 from infant adrenal 

gland-2 (SEQ ID NO: 12) 

TCGGATCCACAGAGACCTGCATGGGAGGTGGGGC 

CACAGGTCTGGTATCAGGCAAACCTAGGTTGGAA 

CACTGGCTCCATAAAGAGGAAGTCACTTAACCTT 

CTCTGGGGCATGGTTTCTTCATCTGTTCCCACCT 

C TGAAGAC TAT C GT AAGACAGAATGAAAGTTAAG 

CAACTTAACGCAACGCCCAGGATACCAGAATTAT 

TCTAAATGGCAGAATCCTACTTAGTCTGTCATCT 

TGGGAGTTCTCTAGGCAGGCAGGTTGCCAGGGGT 

GGGGCTGAGATCCAGATGTGCTCTCGGTATCCCT 

GACA 


AA349232 


RF 


69 


Hu. mitochondrial DNA, frag. Ml 

J01415 (SEQ ID NO: 13) 

TGACACTAGTCACGTGGCAGGGGGTTTTATATTG 

ATAATTGTTGTGATGAAATTGATGGCCCCTAAGA 

TANAGGANAC AC C TG C TAGGTGTAAGGAGAANAT 

GGTTAGGTCTACGGAGGCTCCAGGGTGGGAGTAN 

TTCCCTGCTAAGGGAGGGTAAACTGTTCAACCTG 

TTCCTGCTCCGGCCTCMACTATACCAAWTGCNAC 

CAGGATTAGGAAAAAGGGGGGTAANATCAAAACC 

TTATNTTNTTTATGCGGGGAAACNCCATATCGGG 

GGCACCKATTATTAGGGGAACTATTCATTTCCAA 

ANCCNCCGATTATGATGGGTATTACTATGAANAA 

NATTTTACAAATGCATGGGCTGTGACAANAACNT 

TRTAAATTTGSNCTTACCNAAAANGTTNCCTGGG 

YTGGCCCANCTCGGCTCNAAAAANGAGKCTAAAA 

CTTTTCCTAAGACNCCACCTCATGCCCCNAAAAA 

AAAGTANAGTNTTCCAATTTTTTTGTGGNTTGKA 

AAAAAAAATCAACGGTCGGSAAAAWY 


J01415 


RF 


70 


Hu. satellite rep. DNA; alphoid 
(X14302) (SEQ ID NO: 14) 


X14302 


RF 





TCGACGCTNAGAGTGGACATTTGGAGCGCTTTCA 
GGACGACGGTGAAAATGGAAATATCTTCCAAGAA 
AATCTAGATAGAAGCAATGTCAGAAACTTTTATG 
TGATGGATCTACTCAGCTAACAGAGTTGAACCTT 
TCTTTTGAGAGAGCAGTTTTGCAACACTCTTTTT 
GTGGAATATGCAAGTGGATATTANGGCAGCTTTG 
AGGATTTCGTTGGAAACGGGAATACATGTAAAAA 
GCAGACAGCAGCATTCTCAGAAACTTCTTTGTGA 
TGTTTGCATTGAAGTCACANAGTTNAACATTCCC 
TTTGAGAGAGCAAGTTTGAAACACGCCTTTTGTC 
ATATCTGGAAGTGTCCACTCTCANCGTCGAATCG 
AANTCCCGCNGGCGCCATGGNGGGCGGGAGCATG 
CNACTCNGGCCCAAATCGCCCTANANTGAGTCGT 
AATANAAATCACTGGCCGTCGTTTTACAACGTCT 
GACTGGGGAAAAACCCCC 
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Homo sapiens cDNA FLJ10 870 fis, 

clone NT2RP4001679 (SEQ ID NO: 15) 

ATGNATACGACTCCTATAGGGCGAATTGGGCCCG 

ACGTCGCATGCTCCCGGCCGCCATGGCGGCCGCG 

GGAATTCGATTACCGTCACTAGGCTGCCAAGTAG 

CTGGGACTACAGGCGCATGCCAGCATGCCTGGCT 

AATTTTTGTATTTTTAGTAGAGCGGGGGTTTTGC 

CATGTTGGCCAGGCTGGTCTTGAACTCCTGACCT 

CAGGTGATCCGCCTGCCCTGGCCTCCCAAAGTGC 

TGGGATTACAGGCATGAGCCACTGCGCCCAGCCT 

TGTAATTTACCTTTGATGGTGACCTGAAATGCAG 

TATGATCCTATATGTGGCACGTAGTAATGTTCAG 

TGTTTGCTGAATTAATTCTGGGCAGCCTAGTGAC 

GGTAATCACTAGTGAATTCGCGGGCCGCCTGCAG 

GTCGACCATATGGGAGAGCTCCCAACGCCGTTGG 

ATGCATAGCTTGAGTATTCTATAGTGTCAGCTAA 

ATAGCTTGGGCCGTAATCATGGTCATAGCTT 


AK001732 


RF 
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QV4-BT04 0 7 homo sapiens DNA (SEQ 
ID NO: 16) 

AGTTAGCCGCAGCGAAGAGGACCCAGCAGGAATC 
ATGAGGGAAGGAAAATGCAGCACTCTAAATGGCC 
ACTCAGGCGTTCCTATTCACTCGGAAAATTAGGT 
TCATTTCACAGGACACAGCAGTGTAGATCAGGCT 
TCAACTTAACATTTAAGGGAAATGTCAGATTTTT 
TTTTAATTTAATGAAATTGTTAATGAGGAAAAAT 
TTTTAATATAGTCTTATCTACCACACATCCCCAT 
AGATTTAAGGATTTTAATAGAAAGTCATGATGTA 
TGTATTCAAGCCACGTTAAAAGAAAAAATATAAC 
TATGGACCGGTATTCAGTGAATACAGTTTCATGG 
TTTTTAATTCTTTCAAAGGCACATTAAAAATGGT 
GTGCTGATAAACCCCAAGTAAATTAACCCTTTTT 
CCGTATAAATCCATTTTTTGTTTTGAAGAGGGGA 
AATTATATTTATTGGTTGTTTACTGGAATCCTGG 
TGTGGAAAGCATATCAGATATGTATGAACTGCTA 
CTGGCTGTACTTTCCGATTTACGGGACATCAATT 
TTATTGGCTTATTTGTNANGANCGTGATTAACCA 
TGGACCATTGAAGTACCTATTTTAATGGTNGGGC 
CCTTCAANTGGGGATTGGGNCCAGTGGCCCTTCA 
AGGTCTTCTTGGGGGATTCCGAATNCACTTNAGT 


BE070362 


RF 





GGAATTTCNCNGGNCCGCCTTGCCAAGGTCGANC 
CATNTTNGGGAAAAGCTTCCAAACCGCCGTTTGG 
ATTC 
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Homology to Soares ovary tumor 

NbHOT Homo sapiens cDNA clone 

IMAGE:756918, 5'. (SEQ ID NO: 17) 

CCTTTNTAAACCGACGGCCATTGAATTGTAATAC 

GACTCACTATAGGGCGAATTGGGCCCGACNTCTC 

ATGCTCCCGGCCGCCATGGCGGCCGCGGGAATTC 

NATTGTTACAGCAACCGCCAGGGGGCCTTGGCCA 

GGTCAAGGGTTCTGTGAGGAAAGGACCCANGATT 

GTGGGGGCATTTGGGGGGTGAATTGGCCCCCNAA 

AAATGGAACCCACACCCATAGCTCTCCCCANANC 

TNATACGGCATCCTGCNAAAANACCTGCCCTCCT 

CACTGGGATCCCCTTCCTGCCTCCTCCCAGGGCT 

CTGCCANGGCCTTGCTCAATCCCTTCCACCAAAG 

TCATCTNAACTTCCNTTTCCCCAGGGCCTCCANC 

TGCCCTCAAACACTGATTTNTGTCCCCAGGTGCT 

CTCTGCCCCTCANGCCCCTCTCANCGGCCCANTG 

CCCCGACTCTCCAGGCTTTATCAAGGTGCTAAGG 

CCCGGGTGGGCANCTCCTCGTCTCAAAGCCCTCC 

TCCGGCCTGGTGCTGCCTTTACAAACACCTGCAA 

GAAAANGGCACGGAAACCCCAAGCTTTAAAGCCC 

TCAACAAGTCTGGTGGATCCGAANCACTAATGAA 

TTCNCGGCCCCCTGCANGTCNACAATATNGGAAA 

NCTCCCAACCCGTTTGATGCAAACTTGANTTTCT 

ATANTGTCACCTAAATNCTTGGGGTAATCATGGC 

AAAACTTTTCCCTNTTGGGNNATAA 


HS151B14 


RF 
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Homology to EST17 88 9 Aorta 
endothelial cells, TNF alpha- 
treated Homo sapiens, zv23b05.rl 
Soares NhHMPu SI Homo sapiens cDNA 
clone IMAGE: 754449 (SEQ ID NO: 18) 
ACTAGGCKGACCACACCAACTAGGATGWTCACCA 
CAMCCMCCAGGATGACAATGAAS CTGATAACACC 
AAATGCASCCMCAGTKAGCACANTYTCTGACNTG 
GGGCTGGGCARGATGGTCGCATCTTCCCTCAWGC 
TCARGSTCAWGGTGGTTGAATTGGGGGGAACART 
TTGAAATGTGTCTGGGGTACCAATGCTGGACAAA 
TGGCTTGGCCTGTTAACCTCTGAGGAAGGGATNT 
TCCTGGGTTGGAANAAACTGGCT 


AA308048 


RF 
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Homology to EAP3 0 subunit of ELL 
complex (EAP30) , mRNA (SEQ ID NO: 
19) 

GCTGCACGAGTCAGTCACACTTCGATAATTTGGA 
GACCTAGTTCGTAATAGAAGTCCCCCACGCCCAG 
CATCTCAGACCAAAATCCTTTTCCAGCTACAAGA 
CAGAAAAACAAAATCTCGGTTATTGATTAATTTA 
CATCCCTTCTTTTCTTTCAAACCTGCATTAAAAC 
ATCTATCACATGGGAAGTCTTCCCTGCATGGCAT 
GAAGTTCTTCTGCTCCATCTGCTACTCAGGAAAC 
AATCGGACTCACGGTAACTCCTGCTGAAGAACAG 
GGTCAGCCTAGTGACGG 


AF156102 


RF 









The genes identified were relevant genes for vascular 
endothelial cells and in some cases consistent with the 
known effects of estrogen in the cardiovascular system. One 
5 such gene, prostacyclin 12 synthase, was upregulated 4 fold 
by estradiol at 24 hours (Figure 2) . This enzyme is 
required for that last step in the synthesis of prostacyclin 
(PGI2) . PGI2 has been shown to induce vasodilation, inhibit 
smooth muscle proliferation and platelet aggregation. This 

10 indicates that PGI2 has atherogenesis inhibiting properties. 
Another relevant gene was natriuretic peptide receptor B 
(NPR-B) which is the receptor for C-type natriuretic peptide 
(CNP) , an endothelium dependent vasodilator. 

Further experiments demonstrated that these two genes 

15 were not regulated by estrogen in MCF-7 cells, an ER 

positive breast cancer cell line. Several other genes were 
examined for estrogen regulation in MCF-7 cells and were 
found to be regulated by estrogen in MCF-7 cells (ATP 
sulfurylase/ APS kinase, podocalyxin) , but others were 

20 either not expressed (connexin 37, diastrophic dysplasia 
sulfate transporter) or not regulated by estrogen in a 
similar fashion in MCF-7 cells (clusterin) . This 
demonstrates the usefulness of the VE-ERa cells in 
identifying genes regulated by estrogen in a cell type 

25 specific manner. 

Example 3 

Effect of Estrogen and Certain Estrogen-Related Compounds on 
Cyclin Al Expression 

30 This Example illustrates the effect of 17-p~estradiol 

(E 2 ) and diethylstilbestrol (DES) , an estrogen agonist, on 
cyclin Al expression. 

ERoc enhanced cells as described above were treated by 
adding 0.1% ethanol (ctrl) or lOOnM 17-pl-estradiol (E2) for 

35 various times. RNA was prepared using standard methods and 



RNase protection experiments were performed to evaluate the 
time course of cyclin Al induction. Figure 3 shows the mRNA 
levels of cyclin Al at the indicated times, with 
glyceraldehyde- 3 -phosphate dehydrogenase (GADPH) levels also 
5 provided for reference. The results demonstrate that cyclin 
Al mRNA is increased by treatment with estradiol within 3 
hours . 

In a similar experiment, RNase protection assays were ^ 
performed using estrogen receptor a- and estrogen receptor 

10 p-enhanced vascular endothelial cells. The estrogen 

receptor p -enhanced cells were prepared as described above 
for estrogen receptor a-enhanced cells, except that the 
retroviral vector expressed estrogen receptor P (Kuiper, G. 
PNAS 1996 93:5925-5930) Cells were treated by adding 0.1% 

15 ethanol (ctrl) , lOOnM 17 -p-estradiol (E2) , 1/iM ICI 164384 
(ICI) , or lOnM diethylst ilbestrol (DES) , and RNase 
protection assays were performed as described above. The 
results are presented in Figures 4A and 5B, which show the 
levels of various cyclins in estrogen receptor a- and 

20 estrogen receptor P-enhanced cells, respectively. Arrows 
indicate mRNA for each indicated cyclin. These results 
demonstrate that the estradiol induction of cyclin Al mRNA 
can occur with both estrogen receptor a- and estrogen 
receptor P-enhanced vascular endothelial cells. The "pure" 

25 estrogen antagonist ICI 164384 inhibits this response, 
whereas the synthetic estrogen DES mimics the estradiol 
effect. Cyclin Al is the only cyclin in the panel increased 
by estradiol treatment . 

Cyclin Al expression and the effect of estradiol on 

3 0 expression in a number of immortalized and primary cells 
lines was further evaluated as described above. These 
results are shown in Table II, in which ( + ) indicates 
detectable cyclin Al whereas (-) indicates no detectable 
cyclin Al expression. These results demonstrate that 



although cyclin Al is detectable in some other cell types, 

cyclin Al expression is only increased by estradiol in the £ 

vascular endothelial cells with enhanced estrogen receptor 

a- and estrogen receptor 1 

Table II 

Cyclin Al Expression and Induction in Various Cell Lines 



Cell Line 


Control 


Estradiol 


ERa-enhanced vascular 
endothelial* 




+ + + 


ERp-enhanced vascular 
endothelial* 


+ 


++ 


Vascular endothelial* 


+ 


+ 


Primary vascular endothelial 




+ 


Mammary epithelial* 


+ 


+/- 


MCF-7 (mammary cancer cell 
line) 






Endometrial epithelial* 


+ 


+ 


Osteoblasts* 






Vascular smooth muscle* 






Primary vascular smooth muscle 







Immortalized human cells 



Example 4 

Use of Estrogen-Regulated Markers for Identifying Agents 
with Estrogenic Activity 

15 This Example illustrates screens for estrogenic 

compounds using estrogen-regulated markers. 

An RNase protection assay (RPA) was used to examine a 
panel of genes identified as estrogen-regulated in VE-ERoc 
cells. The reference compounds 17-p-estradiol , 17a-- 

20 estradiol (inactive estrogen) ICI 164 384, tamoxifen, and 

raloxifene and were tested in a multigene assay with VE-ERot 
cells (5 dose - 0.1 nM, 1 nM, 10 nM, 100 nM, 1000 nM for 24 
hours As expected, 17-P-estradiol induced expression of 
representative estrogen-regulated genes (indicated in Figure 

25 5) . ICI 164,780, a compound generally regarded as a pure 



A 



estrogen antagonist, showed no induction of gene expression 
and lowered basal expression of some genes at higher jf 
concentrations. Raloxifene and tamoxifen showed no clear 
induction of any of the genes. In fact, raloxifene had f 
5 similar effects to ICI 164,780, repressing rather than 
inducing gene expression. 

In a further experiment, compounds known to have : 
estrogen- like activity based on ER binding and ERE 
transactivation (ERE luciferase reporter in Chinese hamster 

10 ovary cells expressing transfected ERct) were evaluated. The 
data with these compounds showed a strong correlation 
between ER binding activity and ERE transactivation. In 
general, the compounds that bound strongly to ER also were 
strong activators of transcription, judged by luciferase 

15 activity in the reporter assay. When these compounds were 

examined in the multigene RPA, some compounds showed results 
correlating with the ER binding and transactivation results 
(Table III, compounds D and E) . However, a number of 
compounds which were highly active in ER binding and in the 

2 0 ERE transactivation assay had very low activity in the 

multigene assay (Compounds A, B, C) . In addition, several 
compounds which had significant ER binding and minimal ERE- 
luciferase transactivation were strong activators in the 
multigene assay (Compounds F and G) . This suggests that the 

2 5 assay can detect compounds that are activating through 

promoter elements other than EREs . In order to confirm that 
these active compounds were working in an ER dependent 
manner, VE103-ERa cells were treated with a single dose of 
compound and increasing doses of ICI 164,384. The results 

30 of this experiment demonstrated that ICI 164,384 was able to 
inhibit competitively the activation of these genes, 
indicating ER dependence. Table III shows the results of 
compounds tested on prostacyclin 12 synthase in the VE-ERcx 
cells compared to ER binding and ERE transactivation assay. 

35 All results are presented as percent of estrogen. PGI2 



synthase numbers were based on EC5 0 from a 5 -point dose 
response like that shown for estrogen in Figure 5. 

Table III 

5 Results of Compounds tested on Prostacyclin 12 Synthase in 
the VE-ERcc Cells Compared to ER Binding and ERE 



Transactivation Assay 





Detected In 


Chinese Hamster 


VE-ERa Cells 




Ovary Cells 






Compound 


ER Binding 


ERE 


PG 12 






transactivation 


synthase 


17-p-estradiol 


100 


100 


100 


A 


199 


63 


6 . 5 


B 


170 


55 


2 .3 


C 


74 


130 


8 


D 


197 


70 


129 


E 


103 


101 


97 


F 


6.2 


0.4 


224 


G 


2.2 


0.2 


90 



From the foregoing, it will be appreciated that, 
10 although specific embodiments of the invention have been 
described herein for the purpose of illustration, various 
modifications may be made without deviating from the spirit 
and scope of the invention. Accordingly, the present 
invention is not limited except by the appended claims. 
15 Various publications are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
Various nucleic acids are referred to by accession number 
herein, the sequences of which are incorporated by reference 
in their entireties. 



